FIGURE 1 



CCAGGTCCAACTGCACCTCGGTTCTATCGATTGAATTCCCCGGGGATCCTCTAGAGATCCCT 

CGACCTCGACCCACGCGTCCGCCAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGA 

CAGGCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGC 

AAGGGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGC 

TCCAGCAGCATCAGCAGCCCCCAGGACCGGGGGAGGCACAGGTGGCCCCCACCACCCGGAGG 

AGCAGCTCCTGCCCCTGTCCGGGGGA^CGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGA 

AGGCCACCCCGCCTGGAGGCACAGGCCATGAGGGGCTCTCAGGAGGTGCTGCTGATGTGGCT 

TCTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTTAGGGTGTGTGCT 

GTCCCGGGCTCACGGGGACCCTGTCTGCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCC 

TCACCACCTGCGACGGGCACCGGGCCTGCAGCACCTACCGAACCATTTATAGGACCGCCTAC 

CGCCGCAGCCCTGGGCTGGCCCCTGCCIAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAG 

GACCAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAG 

GGAGCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAG 

TCAGATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCATCAACACCGC 

CGGCAGTTACTGGTGCCAGTGTTGGG^^GGGGCACAGCCTGTCTGCAGACGGTACACTCTGTG 

TGCCCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAG 

GAAGAAGTGCAGAG6CTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCT 

GGCCCCACTGCACAGCCTGGCCTCGC:a.GGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCC 

TCCTGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTC 

CTGGAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGACTCGTGACTGCCCAGCGCCCCAGG 

CTGGACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTC 

CAGAAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTTTTCCTCCTC 

CCCTTCCCTCGGGAGGGTCCCCAGACCCTGGCATGGGATGGGCTGGGATTTTTTTTGTGAAT 

CCAGCCCTGGCTACCCCCACCCTGGTIACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCA 

GCTGAGGGAAGGTACGAGTTCCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCC 

CGGAGGCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAA 

<^GTGAAAAAAAAAAAAAAAAAAAAAAJiAAAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCT 

AGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATGGT 
TACAAAT 



FIGURE 2 



MTDSPPPGHPEEKATPPGGTGHEGLSGGAADVASGVGSGRHRARLPARPLGCVLSRAHGDPV 
SESFVQRVYQPFLTTCDGHRACSTYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLEGAC 
GAAICQPPCRNGGSCVQPGRCRCPAGWRGDTCQSDVDECSARRGGCPQRCINTAGSYWCQCW 
EGHSLSADGTLCVPKGGPPRVAPNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLAS 
QALEHGLPDPGSLLVHSFQQLGRIDSLSEQISFLEEQLGSCSCKKDS 

Signal sequence : 

amino acids 1-19 

cAMP- and cGMP- dependent protein kinase phosphorylation sites. 

amino acids 93-97, 270-274 

N-myristoylation sites. 

amino acids 19-25, 78-84, 97-103, 100-106, 103-109, 157-163, 
191-197, 265-271 

Amidation site. 

amino acids 26-30 

Aspartic acid and asparagine hydroxylation site. 

amino acids 152-164 

Cell attachment sequence. 

amino acids 130-133 

EGF-like domain cysteine pattern signature. 

amino acids 123-135 



FIGURE 3 

CGCTCGCCCCGTCGCCCCTCGCCTC:CCCGCAGAGTCCCCTCGCGGCAGCAGATGTGTGTGGG 
GTCAGCCCACGGCGGGGACTATGGTGAAATTCCCGGCGCTCACGCACTACTGGCCCCTGATC 
CGGTTCTTGGTGCCCCTGGGCATCACCAACATAGCCATCGACTTCGGGGAGCAGGCCTTGAA 
CCGGGGCATTGCTGCTGTCAAGGAGJGATGCAGTCGAGATGCTGGCCAGCTACGGGCTGGCGT 
ACTCCCTCATGAAGTTCTTCACGGGTCCCATGAGTGACTTCAAAAATGTGGGCCTGGTGTTT 
GTGAACAGCAAGAGAGACAGGACCJW^GCCGTCCTGTGTATGGTGGTGGCAGGGGCCATCGC 
TGCCGTCTTTCACACACTGATA6CTTATAGTGATTTAGGATACTACATTATCAATAAACTGC 
ACCATGTGGACGAGTCGGTGGGGAGCAAGACGAGAAGGGCCTTCCTGTACCTCGCCGCCTTT 
CCTTTCATGGACGCAATGGCATGGACCCATGCTGGCATTCTCTTAAAACACAAATACAGTTT 
CCTGGTGGGATGTGCCTCAATCTCAGATGTCATAGCTCAGGTTGTTTTTGTAGCCATTTTGC 
TTCACAGTCACCTGGAATGCCGGGAGCCCCTGCTCATCCCGATCCTCTCCTTGTACATGGGC 
GCACTTGTGCGCTGCACCACCCTGTGCCTGGGCTACTACAAGAACATTCACGACATCATCCC 
TGACAGAAGTGGCCCGGAGCTGGGC5GGAGATGCAACAATAAGAAAGATGCTGAGCTTCTGGT 
GGCCTTTGGCTCTAATTCTGGCCACACAGAGAATCAGTCGGCCTATTGTCAACCTCTTTGTT 
TCCCGGGACCTTGGTGGCAGTTCTGCAGCCACAGAGGCAGTGGCGATTTTGACAGCCACATA 
CCCTGTGGGTCACATGCCATACGGCTGGTTGACGGAAATCCGTGCTGTGTATCCTGCTTTCG 
ACAAGAATAACCCCAGCAACAAACTGGTGAGCACGAGCAACACAGTCACGGCAGCCCACATC 
AAGAAGTTCACCTTCGTCTGCATGGCTCTGTCACTCACGCTCTGTTTCGTGATGTTTTGGAC 
ACCCAACGTGTCTGAGAAAATCTTGATAGACATCATCGGAGTGGACTTTGCCTTTGCAGAAC 
TCTGTGTTGTTCCTTTGCGGATCTI'CTCCTTCTTCCCAGTTCCAGTCACAGTGAGGGCGCAT 
CTCACCGGGTGGCTGATGACACTGJkAGAAAACCTTCGTCCTTGCCCCCAGCTCTGTGCTGCG 
GATCATCGTCCTCATCGCCAGCCTCGTGGTCCTACCCTACCTGGGGGTGCACGGTGC6ACCC 
TGGGCGTGGGCTCCCTCCTGGCGGGCTTTGTGGGAGAATCCACCATGGTCGCCATCGCTGCG 
TGCTATGTCTACCGGAAGCAGAAAiiAGAAGATGGAGAATGAGTCGGCCACGGAGGGGGAAGA 
CTCTGCCATGACAGACATGCCTCCGfACAGAGGAGGTGACAGACATCGTGGAAATGAGAGAGG 
AGAATGAATAAGGCACGGGACGCCA.TGGGCACTGCAGGGACGGTCAGTCAGGATGACACTTC 
GGCATCATCTCTTCCCTCTCCCATCGTATTTTGTTCCCTTTTTTTTGTTTTGTTTTGGTAAT 
GAAAGAGGCCTTGATTTAAAGGTTTCGTGTCAATTCTCTAGCATACTGGGTATGCTCACACT 
GACGGGGGGACCTAGTGAATGGTCI'TTACTGTTGCTATGTAAAAACAAACGAAACAACTGAC 
TTCATACCCCTGCCTCACGAAAACCCAAAAGACACAGCTGCCTCACGGTTGACGTTGTGTCC 
TCCTCCCCTGGACAATCTCCTCTTGGAACCAAAGGACTGCAGCTGTGCCATCGCGCCTCGGT 
CACCCTGCACAGCAGGCCACAGACI'CTCCTGTCCCCCTTCATCGCTCTTAAGAATCAACAGG 
TTAAAACTCGGCTTCCTTTGATTTGCTTCCCAGTCACATGGCCGTACAAAGAGATGGAGCCC 
CGGTGGCCTCTTAAATTTCCCTTCTGCCACGGAGTTCGAAACCATCTACTCCACACATGCAG 
GAGGCGGGTGGCACGCTGCAGCCCGGAGTCCCCGTTCACACTGAGGAACGGAGACCTGTGAC 
CACAGCAGGCTGACAGATGGACAG^^ATCTCCCGTAGAAAGGTTTGGTTTGAAATGCCCCGGG 
GGCAGCAAACTGACATGGTTGAATGATAGCATTTCACTCTGCGTTCTCCTAGATCTGAGCAA 
GCTGTCAGTTCTCACCCCCACCGTGTATATACATGAGCTAACTTTTTTAAATTGTCACAAAA 
GCGCATCTCCAGATTCCAGACCCTGCCGCATGACTTTTCCTGAAGGCTTGCTTTTCCCTCGC 
CTTTCCTGAAGGTCGCATTAGAGCGAGTCACATGGAGCATCCTAACTTTGCATTTTAGTTTT 
TACAGTGAACTGAAGCTTTAAGTCT'CATCCAGCATTCTAATGCCAGGTTGCTGTAGGGTAAb 
TTTTGAAGTAGATATATTACCTGGTTCTGCTATCCTTAGTCATAACTCTGCGGTACAGGTAA 
TTGAGAATGTACTACGGTACTTCCCTCCCACACCATACGATAAAGCAAGACATTTTATAACG 
ATACCAGAGTCACTATGTGGTCCTCCCTGAAATAACGCATTCGAAATCCATGCAGTGCAGTA 
TATTTTTCTAAGTTTTGGAAAGCAGGTTTTTTCCTTTAAAAAAATTATAGACACGGTTCACT 
AAATTGATTTAGTCAGAATTCCTAGACTGAAAGAACCTAAACAAAAAAATATTTTAAAGATA 
TAAATATATGCTGTATATGTTATGTAATTTATTTTAGGCTATAATACATTTCCTATTTTCGC 
ATTTTCAATAAAATGTCTCTAATACJyVAAAA 



FIGURE 4 

MVKFPALTHYWPLIRFLVPLGITNIAIDFGEQALNRGIAAVKEDAVEMLASYGLAYSLMKFF 
TGPMSDFKNVGLVFVNSKRDRTKAVLCMWAGAI AAVFHTL lAYSDLGYY I INKLHHVDES V 
GSKTRRAFLYIJ^PF]ynDAMAWTHfi.GILLKHKYSFLVGCASISDVIAQVVFVAILLHSHLEC 
REPLLIPILSLYMGALVRCTTLCLGYYKNIHDIIPDRSGPELGGDATIRKMLSFWWPLALIL 
ATQRISRPIVNLFVSRDLGGSSAATEAVAILTATYPVGHMPYGWLTEIRAVYPAFDKNNPSN 
KLVSTSNTVTAAHIKKFTFVCMALSLTLCFVMFWTPNVSEKILIDIIGVDFAFAELCWPLR 
IFSFFPVPVTVRAHLTGWLMTLKKTFVLAPSSVLRIIVLIASLWLPYLGVHGATLGVGSLL 
AGFVGESTIWAIAACYVYRKQKiaayiENESATEGEDSAMTDMPPTEEVTDIVEMREENE 

Transmembrane domains : 

amino acids 86-106, 163-179, 191-205, 237-253, 327-343, 357-374, 
408-423, 431-445 



FIGURE 5 



CCTGACAGAAGTGCCCCGGAGCTGGGtGGAGATNCAACATTAAGAAGATGCTGAGCTTCTGGT 
GCCNTTTGGCTCTAATTCTGGCCACi^CAGAGAANCAGTCGGCCTATTGTCAACCTCTTTGTT 
TCCCGGGACCTTGGTGGCAGTTCTGCAGCCACAGAGGCAGTGGCGATTTTGACAGCCACATA 
CCCTGTGGGTCACATGCCATACGGCTGGTTGACGGAAATCCGTGCTGTGTATCCTGCTTTCG 
ACAAGAATAACCCCAGCAACAAACTGGTGAGCACGAGCAACACAGTCACGGCGGCCCACATC 
AAGAAGTTCACCTTCGTCTGCATGGCITCTGTCACTCACGCTCTGTTTCGTGATGTTTTGGAC 
ACCCAACGTGTCTGNGAAAATCTTGATAGACATCATCGGAGTGGACTTTGCCTTTGCAGAAC 
TCTGTGTTGTTCCTTTGCGGATCTTCTCCTTCTTCCCAGTTCCAGTCACAGTGAGGGCGCAT 
CTCACCGGGTGGCTGATGACACTGAAGAAAACCTTCGTC 



FIGURE 6 



TGACGGAATCCCGGGCTGGGTATCCTGGTTTNGACAAGATAAACCCCCAGCAANAAATTGGG 
GAGCAGGGCAAAACAGTNACGGGCAGCCCACATCAAGAAGTTCACCTTNGTTTGNATGGNTC 
TGTCAACTCACGCTNTGTTTCGTGATGTTTTGGACACCCAAAGTGTTTGAGAAAATTTTGAT 
AGACATNATCGGAGTGGANTTTGCCTTTGCAGAANTTTGNGNTGTTCCTTTGCGGATTTTCT 
CCTTTTTCCCAGTTCCAGTCACAGNGAGGGCGCATCTCACCGGGNGGNTGATGACANTGAAG 
AAAACCTTTGTCCTTGCCCCCAGCTNTTTGGTGCGGATCATTGTCCTNATNGCCAGCCTTGT 
GGTCCTACCCTACCTGGGGGTGCACGGTGCGACCCTGGGCGTGGGTTCCCTCCTGGCGGGCA 



FIGURE 7 



TATTCCCAGTTCCGGTCACGGGGAGIGGCGCATNTCACCGGGTGGCTGANGACACTGAAGAAA 
ACCTTNGTCCTTGCCCCCAGNTTTGTGNTGCGGATNATCGTCCTCATCGCCAGCCTNGTGGT 
CCTACCCTACCTGGGGGTGCACGGTGAGAC 



FIGURE 8 



gccccgcgcccggcgccgggcgcccgaagccgggagccaccgccatgggggcctgcctggga 
gcctgctccctgctcagctgcgcgtcctgcctctgcggctctgccccctgcatcctgtgcag 
ctgctgccccgccagccgcaactcciaccgtgagccgcctcatcttcacgttcttcctcttcc 
tgggggtgctggtgtccatcattatgctgagcccgggcgtggagagtcagctctacaagctg 
ccctgggtgtgtgaggagggggccgggatccccaccgtcctgcagggccacatcgactgtgg 
ctccctgcttggctaccgcgctgtctaccgcatgtgcttcgccacggcggccttcttcttct 
tctttttcaccctgctcatgctctgcgtgagcagcagccgggacccccgggctgccatccag 
aatgggttttggttctttaagttcc:tgatcctggtgggcctcaccgtgggtgccttctacat 
ccctgacggctccttcaccaacatctggttctacttcggcgtcgtgggctccttcctcttca 
tcctcatccagctggtgctgctcatcgactttgcgcactcctggaaccagcggtggctgggc 
aaggccgaggagtgcgattcccgtgcctggtacgcaggcctcttcttcttcactctcctctt 
ctacttgctgtcgatcgcggccgtggcgctgatgttcatgtactacactgagcccagcggct 
gccacgagggcaaggtcttcatcagcctcaacctcaccttctgtgtctgcgtgtccatcgct 
gctgtcctgcccaaggtccaggacgcccagcccaactcgggtctgctgcaggcctcggtcat 

CACCCTCTACACCATGTTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCA 
ACCCCCATTTGCCAACCCAGCTGGGfCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAG 
ACCCAGTGGTGGGATGCCCCGAGC^ITTGTGGGCCTCATCATCTTCCTCCTGTGCACCCTCTT 
CATCAGTCTGCGCTCCTCAGACCACCGGCAGGTGAACAGCCTGATGCAGACCGAGGAGTGCC 
CACCTATGCTAGACGCCACACAGCPiGCAGCAGCAGCAGGTGGCAGCCTGTGAGGGCCGGGCC 
TTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCCACTTCTGCCTGGTGCT 
GGCCTCACTGCACGTCATGATGACGCTCACCi^CTGGTACAAGCCCGGTGAGACCCGGAAGA 
TGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCTCCTC 
TACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCT 
CACAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCT 
GCCCCCTCCCCACACCAATCAGCC;i.GGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTG 
CCCCTGAGCCGGGCCTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCA 
GAGCCCCATCCCCCCGCCACACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGT 
TGCCCATACTCAGCATCTCGGATGAAAGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGG 
CTGCTGGAGAGAGCGGGGAACTCCCACCACAGTGGGGCATCCGGCACTGAAGCCCTGGTGTT 
CCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTGGACTTCGTGCCTTACTGAGTCTCT 
AAGACTTTTTCTAATAAACAAGCCAC3TGCGTGTAAAAAAAA 



FIGURE 9 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVE 
SQLYKLPWCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLIiMLCVSSSRD 
PRAAIQNGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSW 
NQRWLGKAEECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFC 
VCVS I AAVLPKVQDAQPNSGLLQASVITLYTMFVTWSALS S I PEQKCNPHLPTQLGNETWA 
GPEGYETQWWDAPS I VGL 1 1 FLLCTLFI SLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVA 
ACEGRAFDNEQDGVTYS YS FFHFCLVLAS LHVmTLTlSnWYKPGETRKMI STWTAVWVKI CAS 
WAGLLLYLWTLVAPLLLRNRDFS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116;, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 



FIGURE 10 



GAGCGAGGCCGGGGACTGAAGGTGT(3GGTGTCGAGCCCTCTGGCAGAGGGTTAACCTGGGTC 
AAATGCACGGATTCTCACCTCGTACAGTTACGCTCTCCCGCGGCACGTCCGCGAGGACTTGA 
AGTCCTGAGCGCTCAAGTTTGTCCGTAGGTCGAGAGAAGGCCATGGAGGTGCCGCCACCGGC 
ACCGCGGAGCTTTCTCTGTAGAGCATTGTGCCTATTTCCCCGAGTCTTTGCTGCCGAAGCTG 
TGACTGCCGATTCGGAAGTCCTTGAGGAGCGTCAGAAGCGGCTTCCCTACGTCCCAGAGCCC 
TATTACCCGGAATCTGGATGGGACCGCCTCCGGGAGCTGTTTGGCAAAGATGAACAGCAGAG 
AATTTCAAAGGACCTTGCTAATATCTGTAAGACGGCAGCTACAGCAGGCATCATTGGCTGGG 
TGTATGGGGGAATACCAGCTTTTATTCATGCTAAACAACAATACATTGAGCAGAGCCAGGCA 
GAAATTTATCATAACCGGTTTGATGCTGTGCAATCTGCACATCGTGCTGCCACACGAGGCTT 
CATTCGTTATGGCTGGCGCTGGGGTTGGAGAACTGCAGTGTTTGTGACTATATTCAACACAG 
TGAACACTAGTCTGAATGTATACCGJAATAAAGATGCCTTAAGCCATTTTGTAATTGCAGGA 
GCTGTCACGGGAAGTCTTTTTAGGATAAACGTAGGCCTGCGTGGCCTGGTGGCTGGTGGCAT 
AATTGGAGCCTTGCTGGGCACTCCTGTAGGAGGCCTGCTGATGGCATTTCAGAAGTACGCTG 
GTGAGACTGTTCAGGAAAGAAliACAGAAGGATCGAAAGGCACTCCATGAGCTAAAACTGGAA 
GAGTGGAAAGGCAGACTACAAGTTACTGAGCACCTCCCTGAGAAAATTGAAAGTAGTTTACG 
GGAAGATGAACCTGAGAATGATGCTiiLAGAAAATTGAAGCACTGCTAAACCTTCCTAGAAACC 
CTTCAGTAATAGATAAACAAGACAAGGACTGAAAGTGCTCTGAACTTGAAACTCACTGGAGA 
GCTGAAGGGAGCTGCCATGTCCGATGAATGCCAACAGACAGGCCACTCTTTGGTCAGCCTGC 
TGACAAATTTAAGTGCTGGTACCTGTGGTGGCAGTGGCTTGCTCTTGTCTTTTTCTTTTCTT 
TTTAACTAAGAATGGGGCTGTTGTACTCTCACTTTACTTATCCTTAAATTTAAATACATACT 
TATGTTTGTATTAATCTATCAATATA.TGCATACATGGATATATCCACCCACCTAGATTTTAA 
GCAGTAAATAAAACATTTCGCAAAAGATTAAAGTTGAATTTTACAGTTT 



FIGURE 11 

></ usr/seqdb2 /ss t/DNA/Dnaseqs . min/ss . DNA2 3318 
xsubunit 1 of 1, 285 aa, 1 stop 
xMW: 32190, pi: 9.03, NX(S/T): 2 

MEVPPPAPRSFLCRALCLPPRVFAAEAVTADSEVLEERQKRLPYVPEPYYPESGWDRLRELF 
GKDEQQRISKDLANICKTAATAGIIGWVYGGIPAFIHAKQQYIEQSQAEIYHNRFDAVQSAH 
RAATRGFIRYGWRWGWRTAVFWIFNTVNTSLNVYRNKDALSHFVIAGAVTGSLFRINVGLR 
GLVAGGIIGALLGTPVGGLLMAFQKYAGETVQERKQKDRKALHELKLEEWKGRLQVTEHLPE 
KIESSLREDEPENDAKKIEALLNLPI^PSVIDKQDKD 

Important Features: 
Signal Peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 76-96 and 171-195 

N-glycosylatibn site: 

amino acids 153-156 



FIGURE 12 

CGGAAGTCCCTTGAGGAGCGTCAGAiiGCGGCTTCCCTACGTCCCAGAGCCCTATTACCCGGA 
ATCTGGATGGGACCGCTCCGGGAGCTGTTTGGCAAAGATGAACAGCAGAGAATTTCAAAGGA 
CCTTGCTAATATCTGTAAGACGGCAGCTACAGCAGGCATCATTGGCTGGGTGTATGGGGGAA 
TACCAGCTTTTATTCATGCTAAACAACAATACATTGAGCAGAGCCAGGCAGAAATTTATCAT 
AACCGGTTTGATGCTGTGCAATCTGtCACATCGTGCTGCCACACGAGGCTTCATTCGTTCATG 
GCTGGCGCCGAACC 



TCAAGTTTGTCCGTAGGTCGAGAGiSAGGCCATGGAGGTGCCGCCACCGGCACCGCGGAGCTT 

TTTTCTGTAGAGCATTGTGCCTATTTCCCCGAGTTTTTGCTGCCGAAGCTGTGACTGCCGAT 

TCGGAAGTCCTTGAGGAGCGTCAGAAGCGGCTTCCCTACGTCCCAGAGCCCTATTACCCGGA 

ATTTGGATGGGACCGCCTCCGGGAGCTGTTTGGCAAAGATGAACAGCAGAGAATTTCAAAGG 

ACCTTGCTGATATNTGTAAGACGGCAGCTACAGCAGGCATCATTGGCTGGGTGTATGGGGGA 

ATACCAGCTTTTATTCATGNTAAACAACAATACATTGAGCAGAGCCAGGCAGAAATTTATNA 
TAACC 



FIGURE 14 



GAGCCGCCGCCGCGCGCGCGCCGCGCACTGCAGCCCCAGGCCCCGGCCCCCCACCCACGTCT 
GCGTTGCTGCCCCGCCTGGGCCAGGCCCCAAAGGCAAGGACAAAGCAGCTGTCAGGGAACCT 
CCGCCGGAGTCGAATTTACGTGCAGCTGCCGGCAACCACAGGTTCCAA GATGG TTTGCGGGG 
GCTTCGCGTGTTCCAAGAACTGCCTGTGCGCCCTCAACCTGCTTTACACCTTGGTTAGTCTG 
CTGCTAATTGGAATTGCTGCGTGGGGCATTGGCTTCGGGCTGATTTCCAGTCTCCGAGTGGT 
CGGCGTGGTCATTGCAGTGGGCATCTTCTTGTTCCTGATTGCTTTAGTGGGTCTGATTGGAG 

GTTCAGTTTTCTGTATCTTGCGCTTGTTTAGCCCTGAACCAGGAGCAACAGGGTCAGCTTCT 
GGAGGTTGGTTGGAACAATACGGCAAGTGCTCGAAATGACATCCAGAGAAATCTAAACTGCT 
GTGGGTTCCGAAGTGTTAACCCAAATGACACCTGTCTGGCTAGCTGTGTTAAAAGTGACCAC 
TCGTGCTCGCCATGTGCTCCAATCATAGGAGAATATGCTGGAGAGGTTTTGAGATTTGTTGG 
TGGCATTGGCCTGTTCTTCAGTTTTACAGAGATCCTGGGTGTTTGGCTGACCTACAGATACA 
GGAACCAGAAAGACCCCCGCGCGAATCCTAGTGCATTCCTTTGATGAGAAAACAAGGAAGAT 
TTCCTTTCGTATTATGATCTTGTTCACTTTCTGTAATTTTCTGTTAAGCTCCATTTGCCAGT 
TTAAGGAAGGAAACACTATCTGGAAAAGTACCTTATTGATAGTGGAATTATATATTTTTACT 

TGAAGCTCGGTGGCACCTGGAATTTACTGTATTCATTGTCGGGCACTGTCCACTGTGGCCTT 
TCTTAGCATTTTTACCTGCAGAAAAACTTTGTATGGTACCACTGTGTTGGTTATATGGTGAA 
TCTGAACGTACATCTCACTGGTATA?^TTATATGTAGCACTGTGCTGTGTAGATAGTTCCTAC 
TGGAAAAAGAGTGGAAATTTATTAAAATCAGAAAGTATGAGATCCTGTTATGTTAAGGGAAA 
TCCAAATTCCCAATTTTTTTTGGTCTTTTTAGGAAAGATTGTTGTGGTAAAAAGTGTTAGTA 
TAAAAATGATAATTTACTTGTAGTCTTTTATGATTACACCAATGTATTCTAGAAATAGTTAT 
GTCTTAGGAAATTGTGGTTTAATTTTTGACTTTTACAGGTAAGTGCAAAGGAGAAGTGGTTT 
CATGAAATGTTCTAATGTATAATAACATTTACCTTCAGCCTCCATCAGAATGGAACGAGTTT 
TGAGTAATCAGGAAGTATATCTATATGATCTTGATATTGTTTTATAATAATTTGAAGTCTAA 
AAGACTGCATTTTTAAACAAGTTAGTATTAATGCGTTGGCCCACGTAGCAAAAAGATATTTG 
ATTATCTTAA2^TTGTTAAATACCGTTTTCATGAAATTTCTCAGTATTGTAACAGCAACTT 
GTCAAACCTAAGCATATTTGAATATGATCTCCCATAATTTGAAATTGAAATCGTATTGTGTG 
GCTCTGTATATTCTGTTAAAAAATTJIVAAGGACAGAAACCTTTCTTTGTGTATGCATGTTTGA 
ATTAAAAGAAAGTAATGGAAG 



FIGURE 15 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA39979 
xsubunit 1 of 1, 204 aa, 1 stop 
xMW: 22147, pi: 8.37, NX(S/T): 3 

MVCGGFACSKNCLCALNLLYTLVSLLLIGIAAWGIGFGLISSLRWGWIAVGIFLFLIALV 
GLIGAVKHHQVLLFFYMIILLLVFIVQFSVSCACLALNQEQQGQLLEVGWNNTASARNDIQR 
NLNCCGFRSVNPNDTCLASCVKSDHSCSPCAPIIGEYAGEVLRFVGGIGLFFSFTEILGVWL 
TYRYRNQKDPRANPSAFL 

Signal Peptide: 

amino acids 1-34 

Transmembrane domains: 

amino acids 47-63, 72-95 and 162-182 



FIGURE 16 

TGATTGGAGCTGTAAAAAANTCTTCJLGGTGTTGTNATTTTTTTATATGATTATTCTGTAANT 
TGTATTTATTGTTCAGTTTTNTGTATCTTGCGCTTGTTTAGCCNTGAACCAGGAGCAACAGG 
GTCAGNTTNTGGAGGTTGGTTGGAACAATACGGCAAGTGCTCGAAATGACATCCAGAGAAAT 
NTAAACTGCTGTGGGTTCCGAAGTGTTAACCCAAATGACACCTGTNTGGCTAGCTGTGTTAA 
AAGTGACCACTNGTGCTCGCCATGTGCTCCAATCATAGGAGAATATGCTGGAGAGGTTTTGA 
GATTTGTTGGTGGCATTGGCCTGTTNTTCAGTTTTACAGAGATCCTGGGTGTTTGGCTGACC 
TACAGATACAGGAACCAG 



AATCCCAAATTCCCCAATTTTTTTGGNCTTTTTAG6GAAAGATGTGTTGTGGTAAAAAGTGT 
TAGTATAAAAATGATAATTTACTTGTAGTCTTTTATGATTACACCAATGTATTCTAGAATAG 
TTATGTCTTAGGAAATTGTGGTTTAATTTTTGACTTTTACAGGTAAGTGCAAAGGAGAAGTG 
GTTTCATGAAATGTTCTAATGTAT^ATAACATTTACCTTCAGCCTCCCATCAGAATGGAACG 
AGTTTTGAGTAATCCAGGAAGTATATCTATATGATCTTGATATTGTTTTATATAATTTGAAG 
TCTAAAAGACTGCATTTTTAAACAAGTTAGTATTAATGCGTTGGCCCACGTAGCAAAAAGAT 
ATTTGATTATCTTAAAAATTGTTAi^ATACCGTTTTCATGAAAGTTCTCAGTATTGTAACAGC 

aacttgtcaaacctaagcatatttg;!^tatgatctcccataatttgaaattgaaatcgtatt 

GTGTGGAGGAAATGGCAATCTTATGTGTGCTGAAGGACACAGTAAGAGCACCAAGTTGTGCC 
CCACTTGC 



FIGURE 18 

ATGATTATTCTGTTACTTGTATTTATTGTTCAGTTTTATGGTATCTTGCGCTTGTTTAGCCC 
CTGAAACCAGGAGCAACAGGGNNCAGCTTCCTGGAGGTTGGTTGGCAACAATCACGGCCAAG 
TGACTCCGCAAATGACATCCCAGAGAAATCCTAAACTGCTGTGGGTTCCGAAGTGTTAACCC 
AAATGACACCTGTCTGGCTNGCTGTGTTAAAAGTGACCACTCGTGCTCGCCATGTGCTCCAA 
TCATAGGAGAATATGC 



msnrimf 



B^IGURE 19 

CAGTCACCATGAAGCTGGGCTGTGTCCTCATGGCCTGGGCCCTCTACCTTTCCCTTGGTGTG 
CTCTGGGTGGCCCAGATGCTACTGGJCTGCCAGTTTTGAGACGCTGCAGTGTGAGGGACCTGT 
CTGCACTGAGGAGAGCAGCTGCCACACGGAGGATGACTTGACTGATGCAAGGGAAGCTGGCT 
TCCAGGTCAAGGCCTACACTTTCAGJTGAACCCTTCCACCTGATTGTGTCCTATGACTGGCTG 
ATCCTCCAAGGTCCAGCCAAGCCAGTTTTTGAAGGGGACCTGCTGGTTCTGCGCTGCCAGGC 
CTGGCAAGACTGGCCACTGACTCAGGTGACCTTCTACCGAGATGGCTCAGCTCTGGGTCCCC 
CCGGGCCTAACAGGGAATTCTCCAa'CACCGTGGTACAAAAGGCAGACAGCGGGCACTACCAC 
TGCAGTGGCATCTTCCAGAGCCCTGGTCCTGGGATCCCAGAAACAGCATCTGTTGTGGCTAT 
CACAGTCCAAGAACTGTTTCCAGCGCCAATTCTCAGAGCTGTACCCTCAGCTGAACCCCAAG 
CAGGAAGCCCCATGACCCTGAGTTGTCAGACAAAGTTGCCCCTGCAGAGGTCAGCTGCCCGC 
CTCCTCTTCTCCTTCTACAAGGATGSGAAGGATAGTGCAAAGCAGGGGGCTCTCCTCAGAATT 
CCAGATCCCCACAGCTTCAGAAGATCACTCCGGGTCATACTGGTGTGAGGCAGCCACTGAGG 
ACAACCAAGTTTGGAAACAGAGCCCCCAGCTAGAGATCAGAGTGCAGGGTGCTTCCAGCTCT 
GCTGCACCTCCCACATTGAATCCAGCTCCTCAGAAATCAGCTGCTCCAGGAACTGCTCCTGA 
GGAGGCCCCTGGGCCTCTGCCTCCGCCGCCAACCCCATCTTCTGAGGATCCAGGCTTTTCTT 
CTCCTCTGGGGATGCCAGATCCTCATCTGTATCACCAGATGGGCCTTCTTCTCAAACACATG 
CAGGATGTGAGAGTCCTCCTCGGTCACCTGCTCATGGAGTTGAGGGAATTATCTGGCCACCA 
GAAGCCTGGGACCACAAAGGCTACTGCTGAATAGAAGTAAACAGTTCATCCATGATCTCACT 
TAACCACCCCAATAAATCTGATTCTTTATTTTCTCTTCCTGTCCTGCACATATGCATAAGTA 
CTTTTACAAGTTGTCCCAGTGTTTTGTTAGAATAATGTAGTTAGGTGAGTGTAAATAAATTT 
ATATAAAGTGAGAATTAGAGTTTAGCTATAATTGTGTATTCTCTCTTAACACAACAGAATTC 
TGCTGTCTAGATCAGGAATTTCTATCTGTTATATCGACCAGAATGTTGTGATTTAAAGAGAA 
CTAATGGAAGTGGATTGAATACAGCAGTCTCAACTGGGGGCAATTTTGCCCCCCAGAGGACA 
TTGGGCAATGTTTGGAGACATTTTGGTCATTATACTTGGGGGGTTGGGGGATGGTGGGATGT 
GTGTCTACTGGCATCCAGTAAATAG^AAGCCAGGGGTGCCGCTAAACATCCTATAATGCACAG 
GGCAGTACCCCACAACGAAAAATAATCTGGCCCAAAATGTCAGTTGTACTGAGTTTGAGAAA 
CCCCAGCCTAATGAAACCCTAGGTGTTGGGCTCTGGAATGGGACTTTGTCCCTTCTAATTAT 
TATCTCTTTCCAGCCTCATTCAGCTATTCTTACTGACATACCAGTCTTTAGCTGGTGCTATG 
GTCTGTTCTTTAGTTCTAGTTTGTA.TCCCCTCAAAAGCCATTATGTTGAAATCCTAATCCCC 
AAGGTGATGGCATTAAGAAGTGGGCCTTTGGGAAGTGATTAGATCAGGAGTGCAGAGCCCTC 
ATGATTAGGATTAGTGCCCTTATTT.ZIAAAAGGCCCCAGAGAGCTAACTCACCCTTCCACCAT 
ATGAGGACGTGGCAAGAAGATGACATGTATGAGAACCAAAAAACAGCTGTCGCCAAACACCG 
ACTCTGTCGTTGCCTTGATCTTGAACTTCCAGCCTCCAGAACTATGAGAAATAAAATTCTGG 
TTGTTTGTAGCCTAA 



FIGURE 20 



></usr/seqdb2/sst/DNA/Dna£3eqs .min/ss .DNA40594 
><s\ibunit 1 of 1, 359 aa, 1 stop 
xMW: 38899, pi: 5.21, NX [S/T) : 0 

MKLGCVLMAWALYLSLGVLWVAQMLLAASFETLQCEGPVCTEESSCHTEDDLTDAREAGFQV 
KAYTFSEPFHLIVSYDWLILQGPAKPVFEGDLLVLRCQAWQDWPLTQVTFYRDGSALGPPGP 
NREFSITWQKADSGHYHCSGIFQSPGPGIPETASWAITVQELFPAPILRAVPSAEPQAGS 
PMTLSCQTKLPLQRSAARLLFSFYKDGRIVQSRGLSSEFQIPTASEDHSGSYWCEAATEDNQ 
VWKQSPQLEIRVQGASSSAAPPTLNPAPQKSAAPGTAPEEAPGPLPPPPTPSSEDPGFSSPL 
GMPDPHLYHQMGLLLKHMQDVR.VLLGHLLMELRELSGHQKPGTTKATAE 

Signal seqpience: 
amino acids 1-17 

Leucine zipper pattern secpience: 

amino acids 12-33 

Protein kinase C phosphorj'lation site; 

amino acids 353-355 



CCCACGCGTCCGCCCACGCGTCCGCCCACGGGTCCGCCCACGCGTCCGGGCCACCAGAAGTT 
TGAGCCTCTTTGGTAGCAGGAGGCTGGAAGAAAGGACAGAAGTAGCTCTGGCTGTGATGGGG 
ATCTTACTGGGCCTGCTACTCCTGGGGCACCTAACAGTGGACACTTATGGCCGTCCCATCCT 
GGAAGTGCCAGAGAGTGTAACAGGACCTTGGAAAGGGGATGTGAATCTTCCCTGCACCTATG 
ACCCCCTGCAAGGCTACACCCAAGTCTTGGTGAAGTGGCTGGTACAACGTGGCTCAGACCCT 
GTCACCATCTTTCTACGTGACTCTTCTGGAGACCATATCCAGCAGGCAAAGTACCAGGGCCG 
CCTGCATGTGAGCCACAAGGTTCCAGGAGATGTATCCCTCCAATTGAGCACCCTGGAGATGG 
ATGACCGGAGCCACTACACGTGTGAillGTCACCTGGCAGACTCCTGATGGCAACCAAGTCGTG 
AGAGATAAGATTACTGAGCTCCGTGTCCAGAAACTCTCTGTCTCCAAGCCCACAGTGACAAC 
TGGCAGCGGTTATGGCTTCACGGTGCCCCAGGGAATGAGGATTAGCCTTCAATGCCAGGCTC 
GGGGTTCTCCTCCCATCAGTTATATTTGGTATAAGCAACAGACTAATAACCAGGAACCCATC 
AAAGTAGCAACCCTAAGTACCTTACTCTTCAAGCCTGCGGTGATAGCCGACTCAGGCTCCTA 
TTTCTGCACTGCCAAGGGCCAGGTTGGCTCTGAGCAGCACAGCGACATTGTGAAGTTTGTGG 
TCAAAGACTCCTCAAAGCTACTCAAGACCAAGACTGAGGCACCTACAACCATGACATACCCC 
TTGAAAGCAACATCTACAGTGAAGCAGTCCTGGGACTGGACCACTGACATGGATGGCTACCT 
TGGAGAGACCAGTGCTGGGCCAGGAi\AGAGCCTGCCTGTCTTTGCCATCATCCTCATCATCT 
CCTTGTGCTGTATGGTGGTTTTTACCATGGCCTATATCATGCTCTGTCGGAAGACATCCCAA 
CAAGAGCATGTCTACGAAGCAGCCAGGTAAGAAAGTCTCTCCTCTTCCATTTTTGACCCCGT 
CCCTGCCCTCAATTTTGATTACTGGCAGGAAATGTGGAGGAAGGGGGGTGTGGCACAGACCC 
AATCCTAAGGCCGGAGGCCTTCAGGGTCAGGACATAGCTGCCTTCCCTCTCTCAGGCACCTT 
CTGAGGTTGTTTTGGCCCTCTGAACACAAAGGATAATTTAGATCCATCTGCCTTCTGCTTCC 
AGAATCCCTGGGTGGTAGGATCCTGATAATTAATTGGCAAGAATTGAGGCAGAAGGGTGGGA 
AACCAGGACCACAGCCCCAAGTCCCTTCTTATGGGTGGTGGGCTCTTGGGCCATAGGGCACA 
TGCCAGAGAGGCCAACGACTCTGGAC3AAACCATGAGGGTGGCCATCTTCGCAAGTGGCTGCT 
CCAGTGATGAGCCAACTTCCCAGAATCTGGGCAACAACTACTCTGATGAGCCCTGCATAGGA 
CAGGAGTACCAGATCATCGCCCAGATCAATGGCAACTACGCCCGCCTGCTGGACACAGTTCC 
TCTGGATTATGAGTTTCTGGCCACTGfAGGGCAAAAGTGTCTGTTAAAAATGCCCCATTAGGC 
CAGGATCTGCTGACATAATTGCCTAGTCAGTCCTTGCCTTCTGCATGGCCTTCTTCCCTGCT 
ACCTCTCTTCCTGGATAGCCCAAAGTGTCCGCCTACCAACACTGGAGCCGCTGGGAGTCACT 
GGCTTTGCCCTGGAATTTGCCAGATGCATCTCAAGTAAGCCAGCTGCTGGATTTGGCTCTGG 
GCCCTTCTAGTATCTCTGCCGGGGGCTTCTGGTACTCCTCTCTAAATACCAGAGGGAAGATG 
CCCATAGCACTAGGACTTGGTCATCATGCCTACAGACACTATTCAACTTTGGCATCTTGCCA 
CCAGAAGACCCGAGGGAGGCTCAGCTCTGCCAGCTCAGAGGACCAGCTATATCCAGGATCAT 
TTCTCTTTCTTCAGGGCCAGACAGCTTTTAATTGAAATTGTTATTTCACAGGCCAGGGTTCA 
GTTCTGCTCCTCCACTATAAGTCTAiiiTGTTCTGACTCTCTCCTGGTGCTCAATAAATATCTA 



FIGURE 22 



></usr/seqdb2/sst/DNA/Dna.seqs .min/ss .DNA45416 
xsubiiiiit 1 of 1, 321 aa, 1 stop 
xMW: 35544, pi: 8.51, NX (S/T) : 0 

MGI LLGLLLLGHLTVDTYGRP I LEVPES VTGPWKGDVNLPCTYDPLQGYTQVLVKWLVQRGS 
DPVTIFLRDSSGDHIQQAKYQGRLKCVSHKVPGDVSLQLSTLEMDDRSHYTCEVTWQTPDGNQ 
VVRDKITELRVQKLSVSKPTVTTGSGYGFTVPQGMRISLQCQARGSPPISYIWYKQQTNNQE 
PIKVATLSTLLFKPAVIADSGSYFCTAKGQVGSEQHSDIVKFWKDSSKLLKTKTEAPTTMT 
YPLKATSTVKQSWDWTTDMDGYLGETSAGPGKSLPVFAIILIISLCCMWFTMAYIMLCRKT 
SQQEHVYEAAR 

Signal Sequence; 

amino acids 1-19 

Glycosaminoglycan attachment site: 

amino acids 149-152 

Transmembrane domain: 

amino acids 2 82-300 



FIGURE 23 



GCGCCGGGAGCCCATCTGCCCCCA(3GGGCACGGGGCGCGGGGCCGGCTCCCGCCCGGCACAT 
GGCTGCAGCCACCTCGCGCGCACCCCGAGGCGCCGCGCCCAGCTCGCCCGAGGTCCGTCGGA 
GGCGCCCGGCCGCCCCGGAGCCAAGCAGCAACTGAGCGGGGAAGCGCCCGCGTCCGGGGATC 
GGGATGTCCCTCCTCCTTCTCCTCTTGCTAGTTTCCTACTATGTTGGAACCTTGGGGACTCA 
CACTGAGATCAAGAGAGTGGCAGAC^AAAAGGTCACTTTGCCCTGCCACCATCAACTGGGGC 
TTCCAGAAAAAGACACTCTGGATATTGAATGGCTGCTCACCGATAATGAAGGGAACCAAAAA 
GTGGTGATCACTTACTCCAGTCGTCATGTCTACAATAACTTGACTGAGGAACAGAAGGGCCG 
AGTGGCCTTTGCTTCCAATTTCCT(3GCAGGAGATGCCTCCTTGCAGATTGAACCTCTGAAGC 
CCAGTGATGAGGGCCGGTACACCTGTAAGGTTAAGAATTCAGGGCGCTACGTGTGGAGCCAT 
GTCATCTTAAAAGTCTTAGTGAGACCATCCAAGCCCAAGTGTGAGTT6GAAGGAGAGCTGAC 
AGAAGGAAGTGACCTGACTTTGCAGTGTGAGTCATCCTCTGGCACAGAGCCCATTGTGTATT 
ACTGGCAGCGAATCCGAGAGAAA6AGGGAGAGGATGAACGTCTGCCTCCCAAATCTAGGATT 
GACTACAACCACCCTGGACGAGTTCTGCTGCAGAATCTTACCATGTCCTACTCTGGACTGTA 
CCAGTGCACAGCAGGCAACGAAGCTGGGAAGGAAAGCTGTGTGGTGCGAGTAACTGTACAGT 
ATGTACAAAGCATCGGCATGGTTGCAGGAGCAGTGACAGGCATAGTGGCTGGAGCCCTGCTG 
ATTTTCCTCTTGGTGTGGCTGCTAJ^TCCGAAGGAAAGACAAAGAAAGATATGAGGAAGAAGA 
GAGACCTAATGAAATTCGAGAAGATGCTGAAGCTCCAAAAGCCCGTCTTGTGAAACCCAGCT 
CCTCTTCCTCAGGCTCTCGGAGCTCIACGCTCTGGTTCTTCCTCCACTCGCTCCACAGCAAAT 
AGTGCCTCACGCAGCCAGCGGACACTGTCAACTGACGCAGCACCCCAGCCAGGGCTGGCCAC 
CCAGGCATACAGCCTAGTGGGGCCAGAGGTGAGAGGTTCTGAACCAAAGAAAGTCCACCATG 
CTAATCTGACCAAAGCAGAAACCACACCCAGCATGATCCCCAGCCAGAGCAGAGCCTTCCAA 
ACGGTCTGAATTACAATGGACTTGACTCCCACGCTTTCCTAGGAGTCAGGGTCTTTGGACTC 
TTCTCGTCATTGGAGCTCAAGTCACCAGCCACACAACCAGATGAGAGGTCATCTAAGTAGCA 
GTGAGCATTGCACGGAACAGATTCAGATGAGCATTTTCCTTATACAATACCAAACAAGCAAA 
AGGATGTAAGCTGATTCATCTGTAAAAAGGCATCTTATTGTGCCTTTAGACCAGAGTAAGGG 
AAAGCAGGAGTCCAAATCTATTTGTTGACCAGGACCTGTGGTGAGAAGGTTGGGGAAAGGTG 
AGGTGAATATACCTAAAACTTTTAA.TGTGGGATATTTTGTATCAGTGCTTTGATTCACAATT 
TTCAAGAGGAAATGGGATGCTGTTTGTAAATTTTCTATGCATTTCTGCAAACTTATTGGATT 
ATTAGTTATTCAGACAGTCAAGCAGAACCCACAGCCTTATTACACCTGTCTACACCATGTAC 
TGAGCTAACCACTTCTAAGAAACTCCAAAAAAGGAAACATGTGTCTTCTATTCTGACTTAAC 
TTCATTTGTCATAAGGTTTGGATATTAATTTCAAGGGGAGTTGAAATAGTGGGAGATGGAGA 
AGAGTGAATGAGTTTCTCCCACTCa?ATACTAATCTCACTATTTGTATTGAGCCCAAAATAAC 
TATGAAAGGAGACAAAAATTTGTGACAAAGGATTGTGAAGAGCTTTCCATCTTCATGATGTT 
ATGAGGATTGTTGACAAACATTAGAAATATATAATGGAGCAATTGTGGATTTCCCCTCAAAT 
CAGATGCCTCTAAGGACTTTCCTGCTAGATATTTCTGGAAGGAGAAAATACAACATGTCATT 
TATCAACGTCCTTAGAAAGAATTCTTCTAGAGAAAAAGGGATCTAGGAATGCTGAAAGATTA 
CCCAACATACCATTATAGTCTCTTCTTTCTGAGAAAATGTGAAACCAGAATTGCAAGACTGG 
GTGGACTAGAAAGGGAGATTAGATCAGTTTTCTCTTAATATGTCAAGGAAGGTAGCCGGGCA 
TGGTGCCAG6CACCTGTAGGAAAATCCAGCAGGTGGAGGTTGCAGTGAGCCGAGATTATGCC 
ATTGCACTCCAGCCTGGGTGACAGAGCGGGACTCCGTCTC 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA45419 
xsubunit 1 of 1, 373 aa, 1 stop 
xMW: 41281, pi: 8.33, N2:(S/T): 3 

MSLLLLLLLVSYYVGTLGTHTEIKIIVAEEKVTLPCHHQLGLPEKDTLDIEWLLTDNEGNQKV 
VITYSSRHVYNNLTEEQKGRVAFASNFLAGDASLQIEPLKPSDEGRYTCKVKNSGRYVWSHV 
ILKVLVRPSKPKCELEGELTEGSDLTLQCESSSGTEPIVYYWQRIREKEGEDERLPPKSRID 
YNHPGRVLLQNLTMSYSGLYQCTAGNEAGKESCWRVTVQYVQSIGMVAGAVTGIVAGALLI 
FLLVWLLIRRKDKERYEEEERPNEIREDAEAPKARLVKPSSSSSGSRSSRSGSSSTRSTANS 
ASRSQRTLSTDAAPQPGLATQAYSLVGPEVRGSEPKKVHHANLTKAETTPSMIPSQSRAFQTV 

Signal sequence : 

amino acids 1-16 

Transmembrane domain: 

amino acids 232-251 



BTGURE 25 



GTCGTTCCTTTGCTCTCTCGCGCCCAGTCCTCCTCCCTGGTTCTCCTCAGCCGCTGTCGGAGGAGAGCACCCGGA 
GACGCGGGOTGCAGTCGCGGCGGCTTCTCCCCGCCTGGGCGGCCTCGCCGCTGGGCAGGTGCTGAGCGCCCCTAG 
AGCCTCCCTTGCCGCCTCCCTCCTCTGCCCGGCCGCAGCAGTGCACATGGGGTGTTGGAGGTAGATGGGCTCCCG 
GCCCGGGAGGCGGCGGTGGATGCGGC6CTGGGCAGAAGCAGCCGCCGATTCCAGCTGCCCCGCGCGCCCCGGGCG 
CCCCTGCGAGTCCCCGGTTCAGCCMGGGGi^CCTCTCCGAGCAGCAGCACCGCCCTCGCOTCCTGCAGCCGCATC 
GCCCGCCGAGCCACAGCCACGATGATCGCGISGCTCCCTTCTCCTGCTTGGATTCOTTAGCACCACCACAGCTCAG 
CCAGAAOVGAAGGCCTCGAATCTCATTGGaiiCATACCGCCATGTTGACCGTGCCACCGGCCAGGTGCTAACCTGT 
GAC^AGTGTCCAGCaiGGAACCTATGTCTCTSAGCATTGTACCAACACAAGCCTGCGCGTCT^ 
GTGGGGACCTTTACCAGGCATGAGAATGGC!VTAGAGAAATGCCATGACTGTAGTCAGCCATGCCCATGGCCAATG 
ATTGAGAAATTACCTTGTGCTGCCTTGACTC3ACCGAGAATGCACTTGCCCACCTGGCATGTTCCAGTCTAACGCT 
ACCTGTGCCCCCCATACGGTGTGTCCTGTaSGTTGGGGTGTGCGGAAGAAAGGGACAGAGACTGAGGATGTGCGG 
TGTAAGCAGTGTGCTCGGGGTACCTTCTCAC3ATGTGCCTTCTAGTGTGATGAAATGCAAAGCATACACAGACTGT 
CTGAGTCAGAACCTGGTGGTGATCAAGCCGGGGACCAAGGAGACAGACAACGTCTGTGGCACACTCCCGTCCTTC 
TCCAGCTCCACCTCACCTTCCCCTGGCACAGCCATCTTTCCACGCCCTGAGCACATGGAAACCCATGAAGTCCCT 
TCCTCCACTTATGTTCCCa^GGCATGAACTCAACAGAATCCAACTCTTCTGCCTCTGTTAGACCAAAGGTACTG 
AGTAGCATCCAG6AAGGGACAGTCCCTQAC3iACa.CAAGCTCAGCy^GGGGQAAGGaAGACGTGAACAAGAC^^ 
COWlCCTTCaGGTAGTCAACCACCAGCAACMCCCCCACCACAGACACATCCTGAAGCTGCTGCCGTCCAT^ 
GCCACTGGGGGCGAGAAGTCCAGCACGCCa\,TCAAGGGCCCCaAGAGGGGACATCCTAGACAGAACCTACACAAG 
CATTTTGACATCAATGAGCATTTGCCCTGGilTGATTGTGCTTTTCCTGCTGCTGGTGCTTGTGGTGATTGTGGTG 
TGCAGTATCCGGAA2^GCTCGAGGACTCTGi\AAAAGGGGCCCCGGCAGGATCCCAGTGCCATTGTGGaAAAGGCA 
GGGCTGAAGAAATCCATGACTCCAACCCAGZACCGGGAGAAATGGATCTACTACTGCAATGGCCATGGTATCGAT 
ATCCTGAAGCrTGTAG<:akGCCCAAGTGGG?^LGCCAGTGGAAAGATATCTATCAGTTTCTTTGCAATGCCAGTGAG 
AGGQAGGTTGCTGOTTTCTCCAATGGGTAC?LCAGCCGACCACGAGCGGGCCTACGCAGCTCTGCAGCACTGGACC 
ATCCGGGGCCCCGAGGCCAGCCTCGCCCAGCTAATTAGCGCCCTGCGCCAGCACCGGAGAAACQATGTTGTGGAG 
AAGATTCGTGGGCTGATGGAAGACACCACCCIAGCTGGAAACTGACAAACTAGCTCTCCCGATGAGCCCCAGCCCG 
CTTAGCCCGAGCCCCATCCCCAGCCCCAACG5CQAAACTTGAGAATTCCGCTCTCCTGACGGTGGAGCCTTCCCCA 
CAGGACaAGAACAAGGGCTTCTTCGTGGATGIAGTCGGAGCCCCTTCTCCGCTGTGACTCTACATCCAGCGGCTCC 
TCCGCGCTGAGCAGGAACGGTTCCTTTATTACCAAAGAAAAGAAGGACACAGTGTTGCGGCAGGTACGCCTGGAC 
CCCTGTGACTTGa\GCCTATCTTTGATGAC7LTGCTCCACTTTCTAAATCCTGAGGAGCTGC^^ 
ATTCCCCAGGCTGAGGACAAACTAGACCGGCTATTCX3AAATTATTGGAGTCAAGAGCCAGGAAGCCAGCCAGACC 
CTCCTGGACTCTGTTTATAGCCATCTTCCTGiACCTGCTGTAfiAACATAGGGATACTGCATTCTGGAAATTACTCA 
ATTTAGTGGCAGGGTGGTTTTTTAATTTTdTCTGTTTCTGATTTTTGTTGTTTGGGGTGTGTGTGTGTGTTTGT 
GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGfTTTAACAGAGAATATGGCCAGTGCTTGAGTTCTTTCTCCTTCTC 
^S?'^J^J^y!;iT'^T^TAAATAACTCTTCTGGGAAGTTGGTTTATAAGCCTTTGCC^ 

ATACCCACCACTAAAGTTTTTTAAGTTCCATATTTTCTCCATTTTGCCTTCTTATGTATTTTCAAGATTATTCTG 
^S?^SP"^^'^'rTACTTAACTTACCATAAa,TGCAGTGTGACTTTTCCCACACaiCTGGATTGTGAG^^ 
TTCTTAAAAGTATAATGGCATCTTGTGAATCCTATAAGCAGTCTTTATGTCTCTTAACATTCACACCTACTTTTT 
^^^S^^^^'^^'^'^^^'^'^^^^'^^'^'^^'^T^^'T'^CTTTATAAATTTTCTT 

S;?i5JS™^^^^'rATTGOTGATTGTATGGTTOTCACCTGGACACCGTGTAGAATGCTTGATTA^ 

GAOiACTGGGCCACCAAAGAACTTGAACTTCACCTTTTAGGATTTGAGCTGTTCTGGAACACATTGCTGCACTTT 

GGAAAGTCAAAATCAAGTGCCAGTGGCGCCCTTTCCATAGAGAATTTGCCCAGCTTTGOTT^ 

TTTTTTATATACACATAATCaATAGGTCCa^TCTGCTCTCAAGGCCTTGGTCCTGGTGGGATTC^ 

ACTTTAATTAAAAATGGCTGCAACTGTAAGftACCCTTGTCTGATATATTTGCAACTATGCTCCCATTTACAAATG 

H^^S^I^'^^^°^^^°'^^'^^*^°^T^<^<=^"rGCAAAGGTGGCGTG^ 

GT^GGTGAAGGACCGATATCAQAAAAATGCCTTCAAGTGTACTAATTTATT 



FIGURE 26 



></usr/s eqdb2 / s s t / DNA/Dna seqs.min/ss. DNA5 2594 
xsiibiinit 1 of 1, 655 aa, 1 stop 
xMW: 71845, pi: 8.22, NX{S/T): 8 

MGTSPSSSTALASCSRIARRATATMIAGSLLLLGFLSTTTAQPEQKASNLIGTYRHVDRATG 
QVLTCDKCPAGTYVSEHCTNTSLRVCSSCPVGTFTRHENGIEKCHDCSQPCPWPMIEKLPCA 
ALTDRECTCPPGMFQSNATCAPHTVCPVGWGVRKKGTETEDVRCKQCARGTFSDVPSSVMKC 
KAYTDCLSQNLWIKPGTKETDNVCGTLPSFSSSTSPSPGTAIFPRPEHMETHEVPSSTYVP 
KGMNSTESNSSASVRPKVLSSIQEGWPDNTSSARGKEDVNKTLPNLQWNHQQGPHHRHIL 
KLLPSMEATGGEKSSTPIKGPKRGHPRQNLHKHFDINEHLPWMIVLFLLLVLWIWCSIRK 
SSRTLKKGPRQDPSAIVEKAGLKKSMTPTQNREKWIYYCNGHGIDILKLVAAQVGSQWKDIY 
QFLCNASEREVAAFSNGYTADHERAYAALQHWTIRGPEASLAQLISALRQHRRNDWEKIRG 
LMEDTTQLETDKLALPMSPS PLS PSPI PS PNAKLENSALLTVE PSPQDKNKGFFVDESEPLL 
RCDSTSSGSSALSRNGSFITKEKKDTVLRQVRLDPCDLQPIFDDMLHFLNPEELRVIEEIPQ 
AEDKLDRLFE I IGVKSQEASQTLLDSVYSHLPDLL 

Signal sequence: 

amino acids 1-41 

Transmembrane domain; 

amino acids 350-370 



ATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGTGGTGCCATCTACATTTTTGGGA 
CTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAGGTCCTGAAATAGTCAC 
CATGGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCATTCCGATCGCTTTTTGGCC 
TTGATGATTTGAAAATAAGTCCTGTTGCACCAGATGCAGATGCTGTTGCTGCACAGATCCTG 
TCACTGCTGCCATTGAAGTTTTTTC'CAATCATCGTCATTGGGATCATTGCATTGATATTAGC 
ACTGGCCATTGGTCTGGGCATCCACTTCGACTGCTCAGGGAAGTACAGATGTCGCTCATCCT 
TTAAGTGTATCGAGCTGATAGCTCGATGTGACGGAGTCTCGGATTGCAAAGACGGGGAGGAC 
GAGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACAGCTGCTTC 
GTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCCCAAC 
TGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGGCAGTTC 
CGGGAGGAGTTTGTGTCCATCGATGACCTCTTGCCAGATGACAAGGTGACTGCATTACACCA 
CTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAGTGCACAGCCT 
GTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCCTTGCTCTCGCAG 
TGGCCCTGGCAGGCCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGGGGCTCTGTCATCAC 
GCCCCTGTGGATCATCACTGCTGCACACTGTGTTTATGACTTGTACCTCCCCAAGTCATGGA 
CCATCCAGGTGGGTCTAGTTTCCCT(3TTGGACAATCCAGCCCCATCCCACTTGGTGGAGAAG 
ATTGTCTACCACAGCAAGTACAAGCCAAAGAGGCTGGGCAATGACATCGCCCTTATGAAGCT 
GGCCGGGCCACTCACGTTCAATGAA?ITGATCCAGCCTGTGTGCCTGCCCAACTCTGAAGAGA 
ACTTCCCCGATGGAAAAGTGTGCTGC5ACGTCAGGATGGGGGGCCACAGAGGATGGAGGTGAC 
GCCTCCCCTGTCCTGAACCACGCGGCCGTCCCTTTGATTTCCAACAAGATCTGCAACCACAG 
GGACGTGTACGGTGGCATCATCTCCCCCTCCATGCTCTGCGCGGGCTACCTGACGGGTGGCG 
TGGACAGCTGCCAGGGGGACAGCGG(3GGGCCCCTGGTGTGTCAAGAGAGGAGGCTGTGGAAG 
TTAGTGGGAGCGACCAGCTTTGGCATCGGCTGCGCAGAGGTGAACAAGCCTGGGGTGTACAC 
CCGTGTCACCTCCTTCCTGGACTGGATCCACGAGCAGATGGAGAGAGACCTAAAAACCTGAA 
GAGGAAGGGGACAAGTAGCCACCTGAGTTCCTGAGGTGATGAAGACAGCCCGATCCTCCCCT 
GGACTCCCGTGTAGGAACCTGCACACGAGCAGACACCCTTGGAGCTCTGAGTTCCGGCACCA 
GTAGCAGGCCCGAAAGAGGCACCCTTCCATCTGATTCCAGCACAACCTTCAAGCTGCTTTTT 
GTTTTTTGTTTTTTTGAGGTGGAGTCTCGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCGAAA 
TCCCTGCTCACTGCAGCCTCCGCTTCCCTGGTTCAAGCGATTCTCTTGCCTCAGCTTCCCCA 
GTAGCTGGGACCACAGGTGCCCGCCACCACACCCAACTAATTTTTGTATTTTTAGTAGAGAC 
AGGGTTTCACCATGTTGGCCAGGCTGCTCTCAAACCCCTGACCTCAAATGATGTGCCTGCTT 
CAGCCTCCCACAGTGCTGGGATTACi\GGCATGGGCCACCACGCCTAGCCTCACGCTCCTTTC 
TGATCTTCACTAAGAACAAAAGAAGCHAGCAACTTGCAAGGGCGGCCTTTCCCACTGGTCCAT 
CTGGTTTTCTCTCCAGGGTCTTGCAJSAATTCCTGACGAGATAAGCAGTTATGTGACCTCACG 
TGCAAAGCCACCAACAGCCACTCAG^^AAAGACGCACCAGCCCAGAAGTGCAGAACTGCAGTC 
ACTGCACGTTTTCATCTCTAGGGACCAGAACCAAACCCACCCTTTCTACTTCCAAGACTTAT 
TTTCACATGTGGGGAGGTTAATCTAGGAATGACTCGTTTAAGGCCTATTTTCATGATTTCTT 
TGTAGCATTTGGTGCTTGACGTATTATTGTCCTTTGATTCCAAATAATATGTTTCCTTCCCT 
CATTGTCTGGCGTGTCTGCGTGGACTGGTGACGTGAATCAAAATCATCCACTGAAA 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA45234 
xsubunit 1 of 1, 453 aa, 1 stop 
xMW: 49334, pi: 6.32, NX(S/T): 1 

MGENDPPAVEAPFSFRSLFGLDDLKISPVAPDADAVAAQILSLLPLKFFPIIVIGIIALILA 
LAIGLGIHFDCSGKYRCRSSFKCIELIARCDGVSDCKDGEDEYRCVRVGGQNAVLQVFTAAS 
WKTMCSDDWKGHYANVACAQLGFPSYVSSDNLRVSSLEGQFREEFVSIDHLLPDDKVTALHH 
SVYVREGCASGHWTLQCTACGHRRGYSSRIVGGNMSLLSQWPWQASLQFQGYHLCGGSVIT 
PLWIITAAHCVYDLYLPKSWTIQVGLVSLLDNPAPSHLVEKIVYHSKYKPKRLGNDIALMKL 
AGPLTFNEMIQPVCLPNSEENFPDGKVCWTSGWGATEDGGDASPVLNHAAVPLISNKICNHR 
DVYGGIISPSMLCAGYLTGGVDSCCIGDSGGPLVCQERRLWKLVGATSFGIGCAEVNKPGVYT 
RVTSFLDWIHEQMERDLKT 

Signal Peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 240-284 



FIGURE 29 



CCCACGCGTCCGTCCTAGTCCCCGGGCCM.CTCGGACAGTTTGCTCATTTATTGCAACGGTCAAGGCTGGCTTGT 

GCCAGAACGGCGCGCGCGCGCGCACGCACGCACACACACGGGGGGAAACTTTTTTAAAAATGAAAGGCTAGAAGA 

GCTCAGCGGCGGCGCGGGCGCTGCGCGAGG^CTCCGGAGCTGACTCGCCGAGGCAGGAAATCCCTCCGGTCGCGA 

CGCCCGGCCCCGGCTCGGCGCCCGCGTGGGaiTGGTGCAGCGCTCGCCGCCGGGCCCGAGAGCTGCTGCACTGAAG 

GCCGGCGACGMGGCAGCGCGCCCGCTGCCCGTGTCCCCCGCCCGCGCCCTCCTGCTCGCCCTGGCCGGTGCTCT 

GCTCGCGCCCTGCGAGGCCCGAGGGGTGAGCTTATGGAACCAAGGAAGAGCTGATGAAGTTGTCAGTGCCTCTGT 

TCGGAGTGGGGACCTCTGGATCCCAGTGAA.GAGCTTCGACTCCAAGAATCATCCAGAAGTGCTGAATATTCGACT 

ACAACGGGAAAGCAAAGAACTGATCATAM.TCTGGAAAGAAATGAAGGTCTCATTGCCAGCAGTTTCACGGA^ 

CCACTATCTGCAAQACGGTACTGATGTCTCCCTCGCTCGAAATTACACGGGTCACTGTTACTACCATGGACATGT 

ACGGGGATATTCTGATTCAGCAGTCAGTCTCAGCACGTGTTCTGGTCTCAGGGGACTTATTGTGTTTGAAAATGA 

AAGCTATGTCTTAGAACCAATGAAAAGTGCAACCAACAGATACAAACTCTTCCCAGCGAAGAAGCTGAAAAGCGT 

CCGGGGATCATGTGGATCACATCACAACACACCAAACCTCGCTGCAAAGAATGTGTTTCCACC».CCCTCTCAG^^ 

ATGGGCAAGAAGGCATAAAAGAGAGACCCTCAAGGCAACTAAGTATGTGGAGCTGGTGATCGTGGCAGACAACCG 

AGAGTTTCaGAGGCaAGGA?^GATCTGGaAA2UlGTTAAGCAGCGATTAATAGAGATTGCTAATCACGT^^ 

GTTTTACAGACCACTGAACATTCGGATCGTGTTGGTAGGCGTGGAAGTGTGGAATGACATGGACAAATGCTCTGT 

AAGTCaCGACCCaTTCACCAGCCTCCa^TGAATTTCTGGACTGGAGGAAGATGAAGCTTCTACCTCGCTiAATCC^ 

TGACaATGCGO^GCTTGTCa^GTGGGGTTTATTTCCAAGGGACCACCATCGGCATGGCCCCaiATCATGAGC^^ 

CaCGGCAGACCTkGTCTGGGGGAATTGTCATGGACCATTCAGACAATCCCCTTGGTGCAGCCGTGACCCTGGCACA 

TGAGCTGGGCCACAATTTCGGGATGAATCATGACACACTGGACAGGGGCTGTAGCTGTCAAATGGCGGTTGAGaA 

AGGAGGCTGCATCATGAACGCTTCCACCGGGTACCCATTTCCCATGGTGTTCAGCAGTTGCAGCAGGAAGGACTT 

GGAQACCAGCCTGGAGAAAGGAATGGGGGTCTGCCTGTTTAACCTGCCGGAAGTCAGGGAGTCTTTCGGGGGCCA 

GAAGTGTGGGAACAQATTTGTGGAAGAAGGAGAGGAGTGTGACTGTGGGGAGCCAGAGGAATGTATGAATCGCTG 

CTGCAATGCCACCACCTGTACCCTGAAGCOSGACGCTGTGTGCGCACATGGGCTGTGCTGTGAAGACTGCCAGCT 

GAAGCCTGCAGGAACAGCGTGCAGGGACTCCAGCAACTCCTGTGACCTCCCAGAGTTCTGCACAGGGGCCAGCCC 

TCACTGCCCAGCCAATGTGTACCTGCACGArGGGCACTCATGTCAGGATGTGGACGGCTACTGCTACAATGGCAT 

CTGCCAGACTCACGAGCAGCAGTGTGTCACGCTCTGGGGACCAGGTGCTAAACCTGCCCCTGGGATCTGCTTTGA 

GAQAGTCaATTCTGCAGGTGATCCTTATGGiZAACTGTGGCAAAGTCTCGAAGAGTTCCTTTGCCAAATG^ 

GAGAGATGCTAAATGTGGaAAAATCCAGTGTCAAGGAGGTGCO^GCCGGCCAGTCATTGGTACCAATGCCGTTTC 

CATAGAAACAAACATCCCTCTGCAGCAAGG?iGGCCGQATTCTGTGCCGGGGGACCCACGTGTACTTGGGCGATGA 

CATGCCGGACCCAGGGCTTGTGCTTGCAGQCACAAAGTGTGCAGATGGAAAAATCTGCCTGAATCGTCAATGTCA 

AaATATTAGTGTCrTTGGGGTTCACGAGTGTGCAATGCAGTGCCa.CGGCAGAGGGGTGTGCaACAAC^ 

^IS^?^^'^^^^^'=^CCACTGGGCaCCTCCCTTCTGTGACAAGTTTGGCTTTGGAGQAAG 

CATCCGGCaAGCAGAAGCTUlGGCAGGAAGCTGCAGAGTCCAACAGGGAGCGCGGCCyiGGGCCAGGAGCCCGTGG^ 
ATCGO^GGAGCATGCGTCTACTGCCTCACTtSACACTCATCTGAGCCCTCCCATGACATGGAGACCGTGAC^^ 
S^^5^^^5^f^^^TCACGCGTCCCCAAG(3CCrCCTGTQACTGGCAGCATTGACTC^^ 
^?^^^'^^^^'^^^'^'^'^^^'^^'<^^-^<^<^^ 

t?52^^^^™C^CTCTGGCAGGCCCC:AGCCCTGCAGC:aAGGAGGAAGAGG^^ 

ACTGAGCCTCCACAGCAGTGGGGGAGAAGakAGGGTTGGGCCCAGTGTCCCCTTTCCCCAGTGACACCTCAGCCT 

^^S^S^ ^^^^^^^^^^^*^C<^TGAACTCTGGCTCTTTGCTGTGGACATGCGTGACC^^ 
TGTTAAAAAGTGAAAACAATGTAAGAGCCT^ACTCCATCCCCCGTGGCC^^ 



X /us r/ seqdb2 / s s t /DNA/Dnaseqs . inin/ s s . DNA4 9 6 24 
xsiibunit 1 of 1, 735 aa, 1 stop 
xMW: 80177, pi: 7.08, NX(S/T): 5 

MAARPLPVSPARALLLALAGALLAPCEARGVSLWNQGRADEWSASVRSGDLWIPVKSFDSK 
NHPEVLNIRLQRESKELIINLERNEGLIASSFTETHYLQDGTDVSLARHYTGHCYYHGHVRG 
YSDSAVSLSTCSGLRGLIVFENESWLEPMKSATNRYKLFPAKKLKSVRGSCGSHHNTPNLA 
AKNVFPPPSQTWARRHKRETLKATKYVELVIVADNREFQRQGKDLEKVKQRLIEIANHVDKF 
YRPIJ^IRIVLVGVEVWITOMDKCSVSQDPFTSLHEFLDWRKMKLLPRKSHDNAQLVSGVYFQG 
TTIGMAPIMSMCTADQSGGIVMDHS:DNPLGAAVTIJmELGHNFG^^mDTLDRGCSCQMAVEK 
GGCIMNASTGYPFPMVFSSCSRKDLETSLEKGMGVCLFNLPEVRESFGGQKCGNRFVEEGEE 
CDCGEPEECMNRCCNATTCTLKPDAVCAHGLCCEDCQLKPAGTACRDSSNSCDLPEFCTGAS 
PHCPANVYLHDGHSCQDVDGYCYNGICQTHEQQCVTLWGPGAKPAPGICFERVNSAGDPYGN 
CGKVSKSSFAKCEMRDAKCGKI QCQGGASRPVIGTNAVS lETNI PLQQGGRI LCRGTHVYLG 
DDMPDPGLVIAGTKCADGKICIJmQCQNISVFGVHECAMQCHGRGVCNNRKNCHCEAHWAPP 
FCDKFGFGGSTDSGPIRQAEARQEASlESNRERGQGQEPVGSQEHASTASLTLI 



Signal peptide: 

amino acids 1-28 



g'lGURE 31 

TCCCAAGGCTTCTTGGATGGCAGATGATTNTGGGGTTTTGCATTGTTTCCCTGACAACGAAA 
ACAAAACAGTTTTGGGGGTTCAGGAGGGGAANTCCAGCCTACCCAGGAAGTTTGCAGAAACA 
GTGCAAGGAAGGGCAGGANTTCCTGGTTGAGNTTTTTGNTAAAACATGGACATGNTTCAGTG 
CTGCTCNTGAGAGAGTAGCAGGTTACCACTTTTGGCAGGCCCCAGCCCTGCAGCAAGGAGGA 
AGAGGACTCAAAAGTTTGGCCTTTCACTGAGCCTCCACAGCAGTGGGGGAGAAGCAAGGGTT 
GGGCCCA6TGTCCCCTTTCCCCAGTGACACCTCAGCCTTGGCAGCCCTGATAACTGGTNTNT 
GGCTGCAANTTAATGCTNTGATATGGCTTTTAGCATTTATTATATGAAAATAGCAGGGTTTT 
AGTTTTTAATTTATCAGAGACCCTGGCACCCATTCCATNTCCATCCAAG 



li'IGURE 32 



CATCCTGCAACATGGTGAAACCACGtCCTGGCTAATTTTGTTGTATTTTTGGTAGAGATGGGA 
TTTCACCGTGTTAGCCAGGATTGTGTCAATCTGACCTCATGATCTGCCCGCCTCGGCCTCCC 
AAAGTGCTGGGATTACAGGCGAGTGCAACCACACCCGGCCACAAACTTTTTAAGAAGTTAAT 
GAAACCATACCTTTTACATTTTTAJ^TGACAGGAAAATGCTCACAATAATTGTTAACCCAAAA 
TTCTGGATACAAAAGTACAATCTTTACTGTGTAAATACATGTATATGTACTATATGAAAATA 
TACCAAATATCAATAATACTTATCTCTGGGTAAAAACCTCTTCTCATACCCTGTGCTAACAA 
CTTTTAACAAAAAATTTGCATCACTTTTAAGAATCAAGAAAAATTTCTGAAGGTCATATGGG 
ACAGAAAAAAAAACCAAGGGAAAAATCACGCCACTTGGGAAAAAAAGATTCGAAATCTGCCT 
TTTTATAGATTTGTAATTAATAAGGTCCAGGCTTTCTAAGCAACTTAAATGTTTTGTTTCGA 
AACAAAGTACTTGTCTGGATGTAGGAGGAAAGGGAGTGATGTCACTGCCATTATGATGCCCC 
TTGAATATAAGACCCTACTTGCTATCTCCCCTGCACCAGCCAGGAGCCACCCATCCTCCAGC 
ACACTGAGCAGCAAGCTGGACACACGGCACACTGATCCAAATGGGTAAGGGGATGGTGGCGA 
TGCTCATTCTGGGTCT6CTACTTCTGGCGCTGCTCCTACCCGTGCAGGTTTCTTCATTTGTT 
CCTTTAACCAGTATGCCGGAAGCTACTGCAGCCGAAACCACAAAGCCCTCCAACAGTGCCCT 
ACAGCCTACAGCCGGTCTCCTTGTGGTCTTGCTTGCCCTTCTACATCTCTACCA TTAAG AGG 
CAGGTCAAGAAACAGCTACAGTTCTCCAACCCATACACTAAAACCGAATCCAAATGGTGCCT 
AGAAGTTCAATGTGGCAAGGAAAAAAACCAGGTCTTCATCAAATCTACTAATTTCACTCCTT 
ATTAACAGAGAAACGCTTQAGAGTCTCAAACTGGACTGGTTTAAAGAGCATCTGAAGGATTT 
GACTAGATGATAAATGCCTGTACTCCCAGTACTTTGGGAGGCCTAGGCCGGCGGATCACCTG 
AGGTCAGGAGTTTGAGACTAACCTGGCCAAAATGGTGAAACCCCATCTGTACTAAAAATACA 
AATATTGACTGGGCGTGGTGGTGAGG^GCCTGTGATCCCAGCTACTCAGGTGGCTGAAGCAGG 
ACAATCACTTGAACTCAGGAGGCAGAGGTTGCAGTGAGCTGAGATCGCGCTACTGCACTCTA 
GCCTAGCCTGGGCAACAGAGTGAGACTTCGTCTCAAAAAAAAAAAAGCCAAGTGCAGTGGCT 
CACGCCTGTAATCCCGGCACTTTGGGAGGCCGAGGTGGGCGGATCACGAGGTCAGGAGATCA 
AGACCATCCTGGCTAATACAGTGAAi^CCCTGTCTCTACTAAAAATACAAAAAATTAGCCGGG 
6ATGGTGGCAGGCACCTGGAGTCCCAGCTACTCGQGAGGCTGAGGCAGGAGAATAGCGTGAA 
CTCAGGAGGCGGAGCTTGCAGTGAGGCGAGATTGCGCTACTGCACTCCAGCCTGGGCGACAG 
CGCGAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAAAAAA 



></usr/ segdb2/sst/DNA/Dnaseqs .min/ss.DNA48309 
xsubunit 1 of 1, 67 aa, 1 stop 
xMW: 6981, pi: 7.47, NX(S/T): 0 

MGKGMVAMLILGLLLLALLLPVQVSSFVPLTSMPEATAAETTKPSNSALQPTAGLLWLLAL 
LHLYH 

Signal peptide: 
amino acids 15-27 



GCCGCGGCGA.GAGCGCGCCCAGCCCCGCCGCGATGCCCGCGCGCCCAGGACGCCTCCTCCCGCTGCTGGCCCGGC 

CGGCGGCCCTGACTGCGCTGCTGCTGCTGCTGCTGGGCCATGGCGGCGGCGGGCGCTGGGGCGCCCGGGCCCAGG 

AGGCGGCGGCGGCGGCGGCGGACGGGCCCCCCGCGGCAGACGGCGAGGACGGACAGGACCCGCACAGCI^GCACC 

TGTACACGGCCGACATGTTCACGCACGGGATCCAGAGCGCCGCGCACTTCGTCATGTTCTTCGCGCCCTGGTGTC 

GACACTGCCAGCGGCTGCAGCCGACTTGGAATGACCTGGGAGACAAATACAACAGCATGGAAGATGCC^^ 

ATGTGGCTAAAGTGGACTGCACGGCCCACTCCGACGTGT6CTCCGCCCAGGGGGTGCGAGGATACCCCACCTTAA 

AGCTTTTOUlGCCAGGCCAAGAAGCTGTGAiGTACCAGGGTCCTCGGGACTTCCaGACACTGGAAAACTGGATGC 

TGCAGACACTGAACGAGGAGCa^GTGACACCAGAGCCGQaAGTGGAACCGCCCAGTGCCCCCGAGCTCAAGC^ 

GGCTGTATQAGCTCTCAGCaAGCAACTTTGAGCTGCACGTTGCACAAGGCGACCaCTTTATa^GTTCTTC^ 

CGTGGTGTGGTCACTGCAAAGCCCTGGCTCiOAACCTGGGAGCAGCTGGCTCTGGGCCTTGAACATTCCGAAACTG 

TCAAGATTGGOlAGGTTGATTGTACACAGailCTATGAACTCTGCTCCGGAAACCaGGTTCGTGGCTATCCC^^ 

TTCTCTGGTTCCGAGATGGGAAAAAGGTGGJOlTCAGTACAAGGGAAAGCGGGATTTGGAGTCACTGAGGGAGTACG 

TGGAGTCGCAGCTGCAGCGCACAGAGACTGSAGCGACGGAGACCGTCACGCCCTCAGAGGCCCCGGTGCTGGCAG 

CTGAGCCCGAGGCTGACaAGGGC&CTGTGTTGGCa^CTCACTGAAAATAACTTCGATQACTlCCATTGC^ 

TAACCTTCaTCAAGTTTTATGCTCCATGGTGTGGTCATTGTAAGACTCTGGCTCCTACTTGGGAGGAACTCTCTA 

AAAAGGAATTCCCTGGTCTGGCGGGGGTCAiilGATCGCCGAAGTAGACTGCACTGCTGaACGGAATATCTGCAGCA 

AGTATTCGGTACGAGGCTACCCOVCGTTATTGCTTTTCCGAGGAGGGAAGAAAGTCaGTGAGCACAGTGGAGGCA 

GAGACCTTGACTCGTTACACCGCTTTGTCCTGAGCCAAGCGAAAGACGaACTTTAGGAACACAGTTGGAGGTC^^ 

CTCTCCTGCCCAGCTCCCGCACCCTGCGTTTAGGAGTTCAGTCCCACAGAGGCCACTGGGTTCCCAGTGGTGGCT 

GTTCaGAAAGCAGAACATACTAAGCGTGAa3TATCTTCTTTGTGTGTGTGTrTTCCAAGCCAACACACTCTACA.G 

ATTCTTTATTAAGTTAAGTTTCTCTAAGTA2\ATGTGTAACTCATGGTCACTGTGTAAACaTTTTCAGTGGCGATA 

TATCCCCTTTGACCTTCTCTTGATGAAATTOACATGGTTTCCTTTGAGACTAAAATAGCGTTGAGGGAAATGAAA 

TTGCTGGACTATTTGTGGCTCCTGAGTTGAGTGATTTTGGTGAAAGAAAGCACATCCAAAGCATAGTTTACCTGC 

CCACGAGTTCTGGAAAGGTGGCCTTGTGGasLGTATTGACGTTCCTCTGATCTTAAGGTCaCAGTTGACTCAATAC 

TGTGTTGGTCCGTAGCATGGAGCAGATTGAiWlTGCAAAAACCCACACCTCTGGAAGATACCTTCACGGCCGCTGC 

TGQAGCTTCTGTTGCTGTGAATACTTCTCTCAGTGTGAGAGGTTAGCCGTGATGAAAGCAGCGTTACTTCTGACC 

GTGCCTGAGTAAGAGAATGCTGATGCCATA?ICTTTATGTGTCGATACTTGTCAAATCAGTTACTGTTCAGGGGAT 

CCTTCTGTTTCTCACGGGGTGAAACATGTCTTTAGTTCCTCATGTTAACaCGAAGCCAQAGCCCACATGAACTGT 

TGGATGTCTTCCTTAGAAAGGGTAGGCATaSAAAATTCCACQAGGCTCATTCTCAGTATCTCATTAACTCATTGA 

AAGATTCa^GTTGTATTTGTCACCTGGGGT<3Aa^GACCAGACAGGCTTTCCCAGGCCTGGGTATCCAGGGAGGC 

TCTGCAGCCCTGCTGAAGGGCCCTAACTAG?iGTTCTAGAGTTTCTGATTCTGTTTCTCAGTAGTCCTTTTAGAGG 

CTTGCTATACTTGGTCTGCTTCAAGGAGGTCGACCTTCTAATGTATGAAGAATGGGATGCATTTGATCTCAAGAC 

CAAAGACAGATGTCAGTGGGCTGCTCTGGCCCTGGTGTGCACGGCTGTGGCAGCTGTTGATGCCAGTGTCCTCTA 

ACTCATGCTGTCCTTGTGATTAAACACCTCTATCTCCCTTGGGAATAAGCACATACAGGCTTAAGCTCTAAGATA 

GATAGGTGTTTGTCCTTTTACCATCGAGCTACTTCCCATAATAACCACTTTGCATCC:»^CaCTCTTCACCCft.CCT 

CCO^TACGCAAGGGGATGTGGATACTTGGCCCAAAGTAACTGGTGGTAGGAATCTTAGAAACAAGACCACTTATA 

CTGTCTGTCTGAGGCAGAAGATAACAGCAGC!ATCTCGACCAGCCTCTGCCTTAAAGGAAATCTTTATTAATCACG 

TATGGTTCACaGATAATTCTTTTTTTAAAMiAACCCAACCTCCTAGAGAAGCACAACTGTCAAGAGTCTTGTACA 

CACTACTTCAGCTTTGCATCACGAGTCTTGTArFCCAAGAAAATCS^GTGGTACy^T^^ 



FIGURE 35 

>< /usr / seqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA4 6 7 7 6 
xsubunit 1 of 1, 432 aa, 1 stop 
xMW: 47629, pi: 5.90, NX (S/T) : 0 

MPARPGRLLPLLARPAALTALLLLLLGHGGGGRWGARAQEAAAAAADGPPAADGEDGQDPHS 

KHLYTimWFTHGIQSAMIFVMFFAPWCGHCQRLQPTlimDLGDKYNSMEDAKVYVAK^ 

SDVCSAQGVRGYPTLKLFKPGQEAV]^QGPRDFQTLENWMLQTLNEEPVTPEPEVEPPSAPE 

LKQGLYELSASNFELHVAQGDHFIKFFAPWCGHCKALAPTWEQLALGLEHSETVKIGKVDCT 

QHYELCSGNQVRGYPTLLWFRDGKK\nDQYKGKRDLESLREYVESQLQRTETGATETVTPSEA 

PVIJ^PEADKGTVLALTENNFDDTIAEGITFIKFYAPWCGHCKTLAPTWEELSKKEFPGLA 

GVKIAEVDCTAERNICSKYSVRGYPTLLLFRGGKKVSEHSGGRDLDSLHRFVLSQAKDEL 

Signal sequence: 

amino acids 1-32 



CTTTTCTGAGGAACCACAGCAATGA?lTGGCTTTGCATCCTTGCTTCGAAGA2yiCCAATTTAT 
CCTCCTGGTACTATTTCTTTTGCAAATTCAGAGTCTGGGTCTGGATATTGATAGCCGTCCTA 
CCGCTGAAGTCTGTGCCACACACAC.AATTTCACCAGGACCCAAAGGAGATGATGGTGAAAAA 
GGAGATCCAGGAGAAGAGGGAAAGCATGGCAAAGTGGGACGCATGGGGCCGAAAGGAATTAA 
AGGAGAACTGGGTGATATGGGAGATCAGGGCAATATTGGCAAGACTGGGCCCATTGGGAAGA 
AGGGTGACAAAGGGGAAAAAGGTTTGCTTGGAATACCTGGAGAAAAAGGCAAAGCAGGTACT 
GTCTGTGATTGTGGAAGATACCGGA?IATTTGTTGGACAACTGGATATTAGTATTGCTCGGCT 

caagacatctatgaagtttgtcaag;\atgtgatagcagggattagggaaactgaagagaaat 

TCTACTACATCGTGCAGGAAGAGAAGAACTACAGGGAATCCCTAACCCACTGCAGGATTCGG 

ggtggaatgctagccatgcccaaggatgaagctgccaacacactcatcgctgactatgttgc 

CAAGAGTGGCTTCTTTCGGGTGTTCATTGGCGTGAATGACCTTGAAAGGGAGGGACAGTACA 

tgtccacagacaacactccactgcagaactatagcaactggaatgagggggaacccagcgac 

CCCTATGGTCATGAGGACTGTGTGGAGATGCTGAGCTCTGGCAGATGGAATGACACAGAGTG 

ccatcttaccatgtactttgtctgt(;agttcatcaagaagaaaaagtaacttccctcatcct 

ACGTATTTGCTATTTTCCTGTGACCGTCATTACAGTTATTGTTATCCATCCTTTTTTTCCTG 
ATTGTACTACATTTGATCTGAGTCAJIiCATAGCTAGAAAATGCTAAACTGAGGTATGGAGCCT 
CCATCATCAAAAAAAAAAAAAAAA 



E^IGURE 37 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ ss .DNA50980 
xsubunit 1 of 1, 277 aa, 1 stop 
xMW: 30645, pi: 7.47, NX(S/T): 2 

MNGFASLLRRNQFILLVLFLLQIQSLGLDIDSRPTAEVCATHTISPGPKGDDGEKGDPGEEG 
KHGKVGRMGPKGIKGELGDMGDQGNIGKTGPIGKKGDKGEKGLLGIPGEKGKAGTVCDCGRY 
RKFVGQLDISIARLKTSMKFVKNVIAGIRETEEKFYYIVQEEKNYRESLTHCRIRGGMLAMP 
KDEAANTLIADYVAKSGFFRVFIGWDLEREGQYMSTDNTPLQNYSNWNEGEPSDPYGHEDC 

vemlssgrwndtechltmyfvcefi:eckkk 

Signal peptide: 

amino acids 1-25 



l^IGURE 38 



ggttctatcgattcgaattcggccacactggccggatcctctagagatccctcgacctcgac 
ccacgcgtccgctgctctccgcccgtgtggagtggtgggggcctgggtggga atgg gcgtgt 
gccagcgcacgcgcgctccctggaaggagaagtctcagctagaacgagcggccctaggtttt 
cggaagggaggatcagggatgttt(3cgagcggctggaaccagacggtgcc6atagaggaagc 
gggctccatggctgccctcctgctgctgcccctgctgctgttgctaccgctgctgctgctga 
agctacacctctggccgcagttgcgctggcttccggcggacttggcctttgcggtgcgagct 
ctgtgctgcaaaagggctcttcgagctcgcgccctggccgcggctgccgccgacccggaagg 
tcccgaggggggctgcagcctggcc:tggcgcctcgcggaactggcccagcagcgcgccgcgc 
acacctttctcattcacggctcgcggcgctttagctactcagaggcggagcgcgagagtaac 
agggctgcacgcgccttcctacgtgcgctaggctgggactggggacccgacggcggcgacag 
cggcgaggggagcgctggagaaggcgagcgggcagcgccgggagccggagatgcagcggccg 
gaagcggcgcggagtttgccggaggkjgacggtgccgccagaggtggaggagccgccgcccct 
ctgtcacctggagcaactgtggcggtgctcctccccgctggcccagagtttctgtggctctg 
gttcgggctggccaaggccggcctgcgcactgcctttgtgcccaccgccctgcgccggggcc 
ccctgctgcactgcctccgcagctgcggcgcgcgcgcgctggtgctggcgccagagtttctg 
gagtccctggagccggacctgcccgccctgagagccatggggctccacctgtgggctgcagg 
cccaggaacccaccctgctggaattagcgatttgctggctgaagtgtccgctgaagtggatg 
ggccagtgccaggatacctctcttccccccagagcataacagacacgtgcctgtacatcttc 
acctctggcaccacgggcctccccaaggctgctcggatcagtcatctgaagatcctgcaatg 
ccagggcttctatcagctgtgtggtgtccaccaggaagatgtgatctacctcgccctcccac 
tctaccacatgtccggttccctgctgggcatcgtgggctgcatgggcattggggccacagtg 
gtgctgaaatccaagttctcggctggtcagttctgggiagattgccagcagcacagggtgac 
ggtgttccagtacattggggagctgtgccgataccttgtcaaccagcccccgagcaaggcag 
aacgtggccataaggtccggctggciagtgggcagcgggctgcgcccagatacctgggagcgt 
tttgtgcggcgcttcgggcccctgciaggtgctggagacatatggactgacagagggcaacgt 
ggccaccatcaactacacaggacagcggggcgctgtggggcgtgcttcctggctttacaagc 
atatcttccccttctccttgattcgctatgatgtcaccacaggagagccaattcgggacccc 
caggggcactgtatggccacatctcicaggtgagccagggctgctggtggccccggtaagcca 
gcagtccccattcctgggctatgctggcgggccagagctggcccaggggaagttgctaaagg 
atgtcttccggcctggggatgttttcttcaacactggggacctgctggtctgcgatgaccaa 
ggttttctccgcttccatgatcgtactggagacaccttcaggtggaagggggagaatgtggc 
cacaaccgaggtggcagaggtcttcgaggccctagattttcttcaggaggtgaacgtctatg 

GAGTCACTGTGCCAGGGCATGAAGGCAGGGCTGGAATGGCAGCCCTAGTTCTGCGTCCCCCC 

cacgctttggaccttatgcagctctacacccacgtgtctgagaacttgccaccttatgcccg 
gccccgattcctcaggctccaggagtctttggccaccacagagaccttcaaacagcagaaag 

TTCGGATGGCAAATGAGGGCTTCGACCCCAGCACCCTGTCTGACCCACTGTACGTTCTGGAC 

caggctgtaggtgcctacctgcccctcacaactgcccggtacagcgccctcctggcaggaaa 
ccttcgaatctgagaacttccacacctgaggcacctgagagaggaactctgtggggtggggg 
ccgttgcaggtgtactgggctgtcagggatcttttctataccagaactgcggtcactatttt 
gtaataaatgtggctggagctgatccagctgtctctgacctaaaaaaaaaaaaaaaaaaaaa 
aaaaaaaaagggcggccgcgactctagagtcgacctgcagtagggataacagggtaataagc 
ttggccgccatggcccaacttgtttattgcag 



E'IGURE 3 9 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA50913 
xsubunit 1 of 1, 73 0 aa, 1 stop 
xMW: 78644, pi: 7.65, NX(S/T): 2 

MGVCQRTRAPWKEKSQLERAALGFRKGGSGMFASGWNQTVPIEEAGSMAALLLLPLLLLLPL 
LLLKLHLWPQLRWLPADIAFAVRALCCKRALRARALAAAAADPEGPEGGCSLAWRLAELAQQ 
RAAHTFLIHGSRRFSYSEAERESNRAARAFIiRALGWDWGPDGGDSGEGSAGEGERAAPGAGD 
AAAGSGAEFAGGDGAARGGGAAAPLSPGATVALLLPAGPEFLWLWFGLAKAGLRTAFVPTAL 
RRGPLLHCLRSCGARALVLAPEFLESLEPDLPALRAMGLHLWAAGPGTHPAGISDLLAEVSA 
EVDGPVPGYLSSPQSITDTCLYIFTSGTTGLPKAARISHLKILQCQGFYQLCGVHQEDVIYL 
ALPLYHMSGSLLGIVGCMGIGATWLKSKFSAGQFWEDCQQHRVTVFQYIGELCRYLVNQPP 
SKAERGHKVRLAVGSGLRPDTWERFVRRFGPLQVLETYGLTEGNVATINYTGQRGAVGRASW 
LYKHIFPFSLIRYDVTTGEPIRDPQGHCMATSPGEPGLLVAPVSQQSPFLGYAGGPELAQGK 
LLKDVFRPGDVFFNTGDLLVCDDQGFLRFHDRTGDTFRWKGENVATTEVAEVFEALDFLQEV 
NVYGVTVPGHEGRAGMAALVLRPPHALDLMQLYTHVSENLPPYARPRFLRLQESIATTETFK 
QQKVRMANEGFDPSTLSDPLYVLDQAVGAYLPLTTARYSALLAGNLRI 

Type II transmembrane domain: 

amino acids 45-65 

Other transmembrane domain: 

amino acids 379-398 

CAMP- and cGMP -dependent protein kinase phosphorylation site 

starting at amino acid 136 

CUB domain protein motif 

amino acids 254-261 

putative AMP -binding domain siganture 

amino acids 332-343 

N-glycosylation sites 

amino acids 37-40 and 483-486 



nnHTHTim 



FIGURE 40 



CCTGTGTTAAGCTGAGGTTTCCCCTAGATCTCGTATATCCCCAACACATACCTCCACGCACA 
CACATCCCCAAGAACCTCGAGCTCACACCAACAGACACACGCGCGCATACACACTCGCTCTC 
GCTTGTCCATCTCCCTCCCGGGGGAGCCGGCGCGCGCTCCCACCTTTGCCGCACACTCCGGC 
GAGCCGAGCCCGCAGCGCTCCAGGATTCTGCGGCTCGGAACTCGGATTGCAGCTCTGAACCC 
CCATGGTGGTTTTTTAAACACTTCTTTTCCTTCTCTTCCTCGTTTTGATTGCACCGTTTCCA 
TCTGGGGGCTAGAGGAGCAAGGCAGCAGCCTTCCCAGCCAGCCCTTGTTGGCTTGCCATCGT 
CCATCTGGCTTATAAAAGTTTGCTGAGCGCAGTCCAGAGGGCTGCGCTGCTCGTCCCCTCGG 
CTGGCAGAAGGGGGTGACGCTGGGCAGCGGCGAGGAGCGCGCCGCTGCCTCTGGCGGGCTTT 
CGGCTTGAGGGGCAAGGTGAAGAGCGCACCGGCCGTGGGGTTTACCGAGCTGGATTTGTATG 
TTGCACCATGCCTTCTTGGATCGGGGCTGTGATTCTTCCCCTCTTGGGGCTGCTGCTCTCCC 
TCCCCGCCGGGGCGGATGTGAAGGCTCGGAGCTGCGGAGAGGTCCGCCAGGCGTACGGTGCC 
AAGGGATTCAGCCTGGCGGACATCCCCTACCAGGAGATCGCAGGGGAACACTTAAGAATCTG 
TCCTCAGGAATATACATGCTGCACCACAGAAATGGAAGACAAGTTAAGCCAACAAAGCAAAC 
TCGAATTTGAAAACCTTGTGGAAGAGACAAGCCATTTTGTGCGCACCACTTTTGTGTCCAGG 
CATAAGAAATTTGACGAATTTTTCCGAGAGCTCCTGGAGAATGCAGAAAAGTCACTAAATGA 
TATGTTTGTACGGACCTATGGCATGCTGTACATGCAGAATTCAGAAGTCTTCCAGGACCTCT 
TCACAGAGCTGAAAAGGTACTACACTGGGGGTAATGTGAATCTGGAGGAAATGCTCAATGAC 
TTTTGGGCTCGGCTCCTGGAACGGATGTTTCAGCTGATAAACCCTCAGTATCACTTCAGTGA 
AGACTACCTGGAATGTGTGAGCAAATACACTGACCAGCTCAAGCCATTTGGAGACGTGCCCC 
GGAAACTGAAGATTCAGGTTACCCGCGCCTTCATTGCTGCCAGGACCTTTGTCCAGGGGCTG 
ACTGTGGGCAGAGAAGTTGCAAACCGAGTTTCCAAGGTCAGCCCAACCCCAGGGTGTATCCG 
TGCCCTCATGAAGATGCTGTACTGCCCATACTGTCGGGGGCTTCCCACTGTGAGGCCCTGCA 
ACAACTACTGTCTCAACGTCATGAACSGGCTGCTTGGCAAATCAGGCTGACCTCGACACAGAG 
TGGAATCTGTTTATAGATGCAATGCTCTTGGTGGCAGAGCGACTGGAGGGGCCATTCAACAT 
TGAGTCGGTCATGGACCCGATAGATGTCAAGATTTCTGAAGCCATTATGAACATGCAAGAAA 
ACAGCATGCAGGTGTCTGCAAAGGTCTTTCAGGGATGTGGTCAGCCCAAACCTGCTCCAGCC 
CTCAGATCTGCCCGCTCAGCTCCTGi\AAATTTTAATACACGTTTCAGGCCCTACAATCCTGA 
GGAAAGACCAACAACTGCTGCAGGCACAAGCTTGGACCGGCTGGTCACAGACATAAAAGAGA 
AATTGAAGCTCTCTAAAAAGGTCTGGTCAGCATTACCCTACACTATCTGCAAGGACGAGAGC 
GTGACAGCGGGCACGTCCAACGAGGAGGAATGCTGGAACGGGCACAGCAAAGCCAGATACTT 
GCCTGAGATCATGAATGATGGGCTCACCAACCAGATCAACAATCCCGAGGTGGATGTGGACA 
TCACTCGGCCTGACACTTTCATCAGJUCAGCAGATTATGGCTCTCCGTGTGATGACCAACAAA 
CTAAAAAACGCCTACAATGGCAATGATGTC2\ATTTCCAGGACACAAGTGATGAATCCAGTGG 
CTCAGGGAGTGGCAGTGGGTGCATGGATGACGTGTGTCCCACGGAGTTTGAGTTTGTCACCA 
CAGAGGCCCCCGCAGTGGATCCCGACCGGAGAGAGGTGGACTCTTCTGCAGCCCAGCGTGGC 
CACTCCCTGCTCTCCTGGTCTCTCACCTGCATTGTCCTGGCACTGCAGAGACTGTGCAGAI& 
ATCTTGGGTTTTTGGTCAGATGAAACTGCATTTTAGCTATCTGAATGGCCAACTCACTTCTT 
TTCTTACACTCTTGGACAATGGACCATGCCACAAAAACTTACCGTTTTCTATGAGAAGAGAG 
CAGTAATGCAATCTGCCTCCCTTTTTGTTTTCCCAAAGAGTACCGGGTGCCAGACTGAACTG 
CTTCCTCTTTCCTTCAGCTATCTGTGGGGACCTTGTTTATTCTAGAGAGAATTCTTACTCAA 
ATTTTTCGTACCAGGAGATTTTCTTACCTTCATTTGCTTTTATGCTGCAGAAGTAAAGGAAT 



FIGURE 41 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA50914 
xsxibunit 1 of 1, 555 aa, 1 stop 
xMW: 62736, pi: 5.36, NX(S/T): 0 

MPSWIGAVILPLLGLLLSLPAGADVKARSCGEVRQAYGAKGFSLADIPYQEIAGEHLRICPQ 
EYTCCTTEMEDKLSQQSKLEFENLVI5ETSHFVRTTFVSRHKKFDEFFRELLENAEKSLNDMF 
VRTYGMLYMQNSEVFQDLFTELKRYirTGGNVNLEEMLNDFWARLLERMFQLINPQYHFSEDY 
LECVSKYTDQLKPFGDVPRKLKIQVTRAFIAARTFVQGLTVGREVANRVSKVSPTPGCIRAL 
MmLYCPYCRGLPTWPCNlSrZCLKrWIKGCIaANQADLDTEWNLFIDAMLLVAERLEGPFNIE^ 
VMDP IDVKI SEAIMNMQENSMQVSAICVFQGCGQPKPAPALRSARSAPENFNTRFRPYNPEER 
PTTAAGTSLDRLVTDIKEKLKLSKKIWSALPYTICKDESVTAGTSNEEECWNGHSK2UIYLPE 
IMNDGLTNQINNPEVDVD ITRPDTF I RQQIMALRVMTNKLKNAYNGLTDVNFQDTSDES SGSG 
SGSGCMDDVCPTEFEFVTTEAPAVDPDRREVDSSAAQRGHSLLSWSLTCIVLALQRLCR 



Signal peptide: 

amino acids 1-23 



FIGURE 42A 

CGGACGCGTGGGCGGACGCGTGGGCaAAAGiAACTCGGAGTGCCAAAGCTAAATAAGTTAGCTGAG^ 

CAGTTTGCAGCGCCTGCGCCGGGTGCGCCflACTACGCAAAGACCT^GCGGGCTCCGCGCGGACCGGCCGCGGGGC 

TAGGGACCCGGCTTTGGCCTTCaLGGCTCCCTAGCa,GCGGGGAAAAGGAATTGCTGCCCGGAGTTTCTGCGGAGGT 

GGAGGGAGATCAGGAAACGGCTTCTTCCTCACTTCGCCGCCTGGTGAGTGTCGGGGAGATTGGCAAACGCCTAGG 

AAAGGACTGGGGAAZ^TAGCCCTGGGAAAGTGGAGAAGGTGATCAGGAGGCCGGTCCACTACGGCAGTTTATCTG 

TCTGATCAGAGCOkGACGCGACGCGTCCACTTCGOlGTTCTTTCCSVGGTGTGGGGACCGaiGGAa^GAC^ 

TCCCGCCGCCCTCCGTACCAGCACTCCCAGGAGAGTCAGCCTCGCTCCCCaACGTCGAGGGCGCTCTGGCCACGA 

AAAGTTCCTGTCCACTGTGATTCTCAATTCCTTGCTTGGTTTTTTTCTCCAGAGAACTTTTGGGTGGAGATATTA 

ACTTTTTTCTTTTTTTTTTTCCTTGGTGGAAGCTGCTCTAGGGAGGGGGGAGGAGGAGGAQAAAGTGAAATGTGC 

TGGAGAAGAGCGAGCCCTCCTTGTTCTTCCGGAGTCCCATCCATTAAGCCATCaCTTCTGGAAQATTAAAGTTGT 

CGGACATGGTGACJ^GCTGAGAGGAGaGGAGaATTTCTTGCCAGGTGGAGAGTCrTCACCGTCTGTTGGGT(^^ 

TGTGCGCCCGCAGCGGCGCGGGGCGCGTGGTTCTCCGCGTGGAGTCTCACCTGGQACCTGAGTGAATGGCTCCCA 

GGGGCTGTGCGGGGCaTCCGCCTCCGCCTTCTCCACAGGCCTGTGTCTGTCCTGGAAAGATGCTAGCAATGGGGG 

CGCTGGCAGGATTCTGGATCCTCTGCCTCCrCACTTATGGTTACCTGTCCTGGGGCCAGGCCTTAGAAGAGGAGG 

AAGAAGGGGCCTTACTAGCTCAAGCrGGAGAGAAACTAGAGCCCAGCACaACTTCCACCTCCCAGCCCCa^ 

TTTTCATCCTAGCGGATGATCAGGQATTTASAGATGTGGGTTACCACGGATCTGAGATTAAAACACCTACTCTTG 

ACAAGCTCGCTGCCGAAGGAGTTAAACTGGa.GAACTACTATGTCCAGCCTATTTGCACACCATCCAGGAGTCAGT 

TTATTACTGGAAAGTATCAGATACACACCGGACTTCAACATTCTATCATAAGACOTACCCAACCCAACTGTTTAC 

CTCTGGACSkATGCCACCCTACCTCAGAAACTGAAGGAGGTTGGATATTCAACGCATATGGTCGGAAAATGG^^ 

TGGGTTTTAACAGAAAAGAATGCATGCCOVCCAGAAGAGGATTTGATACCTTTTTTGGTTCCCTTTTGGGAAGTG 

GGGATTACTATACACACTACAAATGTGACA(3TCCTGGGATGTGTGGCTATGACTTGTATGAAAACGAC^ 

CCTGGGACTATQACaATGGCATATACTCCA'CACAGATGTACaCTCAGAGAGTACAGCAAATCTTAG^^ 

ACCCC:a.CaAAGCCTATATTTTTATATACTGCCTATCAAGCTGTTCa.TTCACCACTGCM.GCT^ 

TCGAACACTACCGATCCATTATO^CATAA/VCaGGAGAAGATATGCTGCCATGCTTTCCTGCTTAGATGAAGCA^ 

TCAACAACGTGACaTTGGCTCTAAAGACTTATGGTTTCTATAACAACAGaiTTATCATTTACTCTTaiGAT^ 

GTGGCCAGCCTACGGCAGGAGGGAGTAACTlSGCCrCTCAGAGGTAGCAAAGGAACATATTGGGAAGGAG^ 

GGGCTGTAGGCTTTGTGCATAGCCCACTTCrGAAAAACAAGGGAACAGTGTGTAAGGAACTTGTGCACATCACTG 

ACTGGTACCCCACTCT(^TTTCACTGGCTG2iAGGACAGATTGATGAGGACATTCAACTAGATGGCTATGATATC^ 

GGGAGACCaTAAGTGAGGGTCTTCGCTCACCCCGAGTAGATATTTTGCATAACATTQACCCCTATACACCAAGGC 

AAAAAATGGCTCCTGGGCAGCAGGCTATGa3ATCTGGAACaCTGCAATCCAGTCAGCCATCAGAGTGCAG(^ 

GAAATTGCTTACAGGAAATCCTGGCTACAGCGACTGGGTCCCCCCTCAGTCTTTCAGCAACCTGGGACCGAACCG 

GTGGCACAATGAACGGATCACCTTGTCAACTGGCAAaAGTGTATGGCTTTTCmACATCaCAGCC^ 

6AGGGTGGACCTATCTAAC2iGGTATCCaGG?vATCGTGAAGAAGCTCCTACGGAGGCTCTCACa^ 

TGC:SlGTGCCGGTCa.GGTATCCCCCG?iAAGAC:CCCAGAAGTAACCCTAGGCTCAATGGAGGGGTCTGGG^ 

GTATAAAGAGGAAACCAAQAAAAAGAAGCCTAGCAAAAATCAGGCTGAGAAAAAGCAAAAGAAAAGCAAAAAA^ 

GAAGAAGAAACAGCAGAAAGCAGTCTCa.GGTAAACCAGa^TTTGGCTCGATAATATCGCTGGCCT^ 

GGCTTGTTTTCATGCTGTGCCACTCCAGAG?LCTTCTGCCACCTGGCCGCCACACTGAAAACTGTCCTGCTCAGTG 

CCaAGGTGCTACTCTTGCaAGCCACACTTAGAGAGAGTGGAGATGTTTATTTCTCTCGCTCCTTTAGAAAACGTG 

GTGAGTCCTGAGTTCCACTGCTGTGCTTCAGlTCa^CTGACCAAACACTGCTTTGAATTATAGGAGGAGAAC^ 

ACCTACCATCCGCAAGCATGCTAATTTGATGGAAGTTACAGGGTAGCATGATTAAAACTACCTTTGATAAATTAC 



AGTCaAAGATTGTGTCACCTCAAAGGCCTTGAAGAATATATTTTCTTGGTGAATTTTTGTATGTCTGTCATATGA 

CACTTGGGTTTTTTAATTAATTCTATTTTATATATATAAATATATGTTTCTTTTCCTGTGAAAAGCTGTTTTTCT 

CACyiTGTQAACAGCTTGCACCTCATTTTACtllATGCGTGAGGGAATGGCaVAATAAGAATGTTT^ 

Aa^TGAATGTAACTATTTTCTAAACACTTTACTAGAAGAACaTTTCAGTATAA?iAAACCTAATTTATT^ 

GAAAAATATTTTGTTGTTTTTATAAAAAGTirATGCaAATGACTTTTATTTTTATTTCCTGO^TACCATTAGAAGA 

ATTTTATTTCATTTCTTCaAATTATCAAGC2kCTGTAATACTATAAATTAATGTAATACTGTGTGAATTCAGACTA 

TAAAAAACATCaTTCAQAAAACTTTATAATC^GTCATTGTTCaATCaAGATTTTGAATGTAATAAGA^ 

ATTACTTGGAAATTCaATGTTTGTGCAGAGTTGAGACAACTTTATTGTTTCTATCakTAAACTATTTATGTAT^ 

AATTATTAAAATGATTTACTTTATGGCACTAGAAAATTTACTGTGGCTTTTCTGATCTAACTTCTAGCTAAAATT 

GTATCATTGGTCCTAAAAAATAAAAATCTTTACTAATAGGCAATTGAAGGAATGGTTTGCTAACAACCACAGTAA 

TATAATATGATTTTACaCATACSATGCTTCCCCTTGGCTATGACATGGAGAAAGATTTTCCCATAATAATAACTAA 

TATTTATATTAGGTTGGTGCAAAACTAGTTCXIGGTTTTTCCCATTAAAAGTAATAACCTTACTCTTATACAAAGT 

GGAC:a.CTGTGGGGAGATACaGAGAAATGGAJ\.GATACGGATCCTGCOTGGAGTAGGTAACCTTGCTTGGAAACCCC 

ACATGCAAACGTCATGAGGAGAATTAAAGG?^GTATTATCAGTAATGAAGTTTATCSkTGGGTCATCaUlTGA^ 

GATTGGTGTGGATCCTGTAGACCCTGGTGTTTTCTTTGAAGTGCCCTCTCCTAATGCAGAGGCCTTGAAGCTTAC 

AGTATACACTTGAAAAGTCACAGATAGCTAGUyiTTATGATCTTTGAAGTTATAACTGTGATCTGAAAATGTGTGT 

GGTGGTATGACaGCATACCaTTAAATAa^TTTACaTCACAGCTCAAAGGACTGTGATATAATCCa 

AACTCAAAGGACTGTGATATAATCCATTTATATCAa^GCTCSVCAGTTTCTGAAAATGTATAAAAGAATCTAT 

CTAGTACTGAAATTACTA2^TTGGGTAAGATGATTTAAATGATTTTAATTTTAACATTTTATTTCTAGAATATAT 

GGCTCCa.TTTTATTTTATAGTGTAAAGTTGTATTTCCTAAAGTTTGTGTTTTGTCGACAGTATCTTTTAAATGAG 

TCTTAAAAATAAAGGCATATTGTTCATGTTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAA?U^AAAAAAAAAAAAAAAMiAAAIUUiAAAAAAAZiAAAAAAAAAAAAAAAAAAAA 



FIGURE 43 

></usr/ seqdb2/sst/DNA/Dnciseqs .min/ss . DNA4 82 96 
xstjbiinit 1 of 1, 515 aa, 1 stop 
xMW: 56885, pi: 6.49, KK{S/T) : 5 

MAPRGCAGHPPPPSPQACVCPGKMLjyyiGALAGFWILCLLTYGYLSWGQALEEEEEGALLAQA 
GEKLEPSTTSTSQPHLIFILADDQGFRDVGYHGSEIKTPTLDKLAAEGVKLENYYVQPICTP 
SRSQFITGKYQIHTGLQHS I IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFNRKEC 
MPTRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 
ILASHNPTKPIFLYTAYQAVHSPLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLA 
LKTYGFYNNSIIIYSSDNGGQPTAGGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCK 
ELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSPRVDILHNIDPYTPRQKMAPG 
QQAMGSGTLQSSQPSECSTGNCLQEILATATGSPLSLSATWDRTGGTMNGSPCQLAKVYGFS 
TSQPTHMRGWTYLTGIQES 

Important Features : 
Signal Peptide: 

amino acids 1-37 

Sulfatases signature 1. 

amino acids 120-132 

Sulfatases signature 2. 

amino acids 168-177 

Tyrosine kinase phosphorylation site. 

amino acids 163-169 

N-glycosylation sites. 

amino acids 157-160, 306-309 and 318-321 



FIGURE 44 

CGGACGCGTGGGTGCGAGTGGAGCGGAGGACCCGAGCGGCTGAGGAGAGAGGAGGCGGCGGC 
TTAGCTGCTACGGGGTCCGGCCGGCGCCCTCCCGAGGGGGGCTCAGGAGGAGGAAGGAGGAC 
CCGTGCGAGAATGCCTCTGCCCTGGAGCCTTGCGCTCCCGCTGCTGCTCTCCTGGGTGGCAG 
GTGGTTTCGGGAACGCGGCCAGTGCJLAGGCATCACGGGTTGTTAGCATCGGCACGTCAGCCT 
GGGGTCTGTCACTATGGAACTAAACTGGCCTGCTGCTACGGCTGGAGAAGAAACAGCAAGGG 
AGTCTGTGAAGCTACATGCGAACCTGGATGTAAGTTTGGTGAGTGCGTGGGACCAAACAAAT 
GCAGATGCTTTCCAGGATACACCGGCSAAAACCTGCAGTCAAGATGTGAATGAGTGTGGAATG 
AAACCCCGGCCATGCCAACACAGATGfTGTGAATACACACGGAAGCTACAAGTGCTTTTGCCT 
CAGTGGCCACATGCTCATGCCAGATGCTACGTGTGTGAACTCTAGGACATGTGCCATGATAA 
ACTGTCAGTACAGCTGTGAAGACACi^GAAGAAGGGCCACAGTGCCTGTGTCCATCCTCAGGA 
CTCCGCCTGGCCCCAAATGGAAGAGACTGTCTAGATATTGATGAATGTGCCTCTGGTAAAGT 
CATCTGTCCCTACAATCGAAGATGTGTGAACACATTTGGAAGCTACTACTGCAAATGTCACA 
TTGGTTTCGAACTGCAATATATCAGTGGACGATATGACTGTATAGATATAAATGAATGTACT 
ATGGATAGCCATACGTGCAGCCACCATGCCAATTGCTTCAATACCCAAGGGTCCTTCJiAGTG 
TAAATGCAAGCAGGGATATAAAGGCJSATGGACTTCGGTGTTCTGCTATCCCTGAAAATTCTG 
TGAAGGAAGTCCTCAGAGCACCTGGTACCATCAAAGACAGAATCAAGAAGTTGCTTGCTCAC 
AAAAACAGCATGAAAAAGAAGGCAA^^AATTAAAAATGTTACCCCAGAACCCACCAGGACTCC 
TACCCCTAAGGTGAACTTGCAGCCC1?TCAACTATGAAGAGATAGTTTCCAGAGGCGGGAACT 
CTCATGGAGGTAAAAAAGGGAATGA5.GAGAAATGAAAGAGGGGCTTGAGGATGAGAAAAGAG 
AAGAGAAAGCCCTGAAGAATGACATAGAGGAGCGAAGCCTGCGAGGAGATGTGTTTTTCCCT 
AAGGTGAATGAAGCAGGTGAATTCGGCCTGATTCTGGTCCAAAGGAAAGCGCTAACTTCCAA 
ACTGGAACATAAAGATTTAAATATCTCGGTTGACTGCAGCTTCAATCATGGGATCTGTGACT 
GGAAACAGGATAGAGAAGATGATTTTGACTGGAATCCTGCTGATCGAGATAATGCTATTGGC 
TTCTATATGGCAGTTCCGGCCTTGGCAGGTCACAAGAAAGACATTGGCCGATTGAAACTTCT 
CCTACCTGACCTGCAACCCCAAAGCJiACTTCTGTTTGCTCTTTGATTACCGGCTGGCCGGAG 
ACAAAGTCGGGAAACTTCGAGTGTTTGTGAAAAACAGTAACAATGCCCTGGCATGGGAGAAG 
ACCACGAGTGAGGATGAAAAGTGGAAGACAGGGAAAATTCAGTTGTATCAAGGAACTGATGC 
TACCAAAAGCATCATTTTTGAAGCAGJAACGTGGCAAGGGCAAAACCGGCGAAATCGCAGTGG 
ATGGCGTCTTGCTTGTTTCAGGCTTATGTCCAGATAGCCTTTTATCTGTGGATGACTGAATG 
TTACTATCTTTATATTTGACTTTGTA.TGTCAGTTCCCTGGTTTTTTTGATATTGCATCATAG 
GACCTCTGGCATTTTAGAATTACTAGCTGAAAAATTGTAATGTACCAACAGAAATATTATTG 
TAAGATGCCTTTCTTGTATAAGATATGCCAATATTTGCTTTAAATATCATATCACTGTATCT 
TCTCAGTCATTTCTGAATCTTTCCNCATTATATTATAAAATNTGGAAANGTCAGTTTATCTC 
CCCTCCTCNGTATATCTGATTTGTATANGTANGTTGATGNGCTTCTCTCTACAACATTTCTA 
GAAAATAGAAAAAAAAGCACAGAGA2VATGTTTAACTGTTTGACTCTTATGATACTTCTTGGA 
AACTATGACATCAAAGATAGACTTTTGCCTAAGTGGCTTAGCTGGGTCTTTCATAGCCAAAC 
TTGTATATTTAATTCTTTGTAATAAT.AA 



FIGURE 45 

MPLPWSLALPLLLSWVAGGFGNAASi^RHHGLLASARQPGVCHYGTKLACCYGWRRNSKGVCE 
ATCEPGCKFGECVGPNKCRCFPGYTGKTCSQDVNECGMKPRPCQHRCVNTHGSYKCFCLSGH 
MLMPDATCVNSRTCAMINCQYSCEDTEEGPQCLCPSSGLRLAPNGRDCLDIDECASGKVICP 
YNRRCVNTFGSYYCKCHIGFELQYISGRYDCIDINECTMDSHTCSHHANCFEJTQGSFKCKCK 
QGYKGNGLRCSAIPENSVKEVLRAPGTIKDRIKKLIJ^KNSMKXKAKIKNVTPEPTRTPTPK 
VNLQPFNYEEIVSRGGNSHGGKKGNEEK 

Signal peptide: 
amino acids 1-21 

EGF-like domain cysteine pattern signature. 

amino acids 80-91 

Calcium-binding EGF-like domains 

amino acids 103-124, 230-251 and 185-206 



FIGURE 46 

GGGAGCTGCTGCTGTGGCTGCTGGTGCTGTGCGCGCTGCTCCTGCTCTTGGTGCAGCTGCTG 
CGCTTCCTGAGGGCTGACGGCGACCITGACGCTACTATGGGCCGAGTGGCAGGGACGACGCCC 
AGAATGGGAGCTGACTGATATGGTGGTGTGGGTGACTGGAGCCTCGAGTGGAATTGGTGAGG 
AGCTGGCTTACCAGTTGTCTAAACTAGGAGTTTCTCTTGTGCTGTCAGCCAGAAGAGTGCAT 
GAGCTGGAAAGGGTGAAAAGAAGATGCCTAGAGAATGGCAATTTAAAAGAAAAAGATATACT 
TGTTTTGCCCCTTGACCTGACCGACACTGGTTCCCATGAAGCGGCTACCAAAGCTGTTCTCC 
AGGAGTTTGGTAGAATCGACATTCTGGTCAACAATGGTGGAATGTCCCAGCGTTCTCTGTGC 
ATGGATACCAGCTTGGATGTCTACAGAAAGCTAATAGAGCTTAACTACTTAGGGACGGTGTC 
CTTGACAAAATGTGTTCTGCCTCACATGATCGAGAGGAAGCAAGGAAAGATTGTTACTGTGA 
ATAGCATCCTGGGTATCATATCTGTACCTCTTTCCATTGGATACTGTGCTAGCAAGCATGCT 
CTCCGGGGTTTTTTTAATGGCCTTC(3AACAGAACTTGCCACATACCCAGGTATAATAGTTTC 
TAACATTTGCCCAGGACCTGTGCAATCAAATATTGTGGAGAATTCCCTAGCTGGAGAAGTCA 
CAAAGACTATAGGCAATAATGGAGACCAGTCCCACAAGATGACAACCAGTCGTTGTGTGCGG 
CTGATGTTAATCAGCATGGCCAATGATTTGAAAGAAGTTTGGATCTCAGAACAACCTTTCTT 
GTTAGTAACATATTTGTGGCAATACft.TGCCAACCTGGGCCTGGTGGATAACCAACAAGATGG 
GGAAGAAAAGGATTGAGAACTTTAAGAGTGGTGTGGATGCAGACTCTTCTTATTTTAAAATC 
TTTAAGACAAAACATGACTGAAAAGAGCACCTGTACTTTTCAAGCCACTGGAGGGAGAAATG 
GAAAACATGAAAACAGCAATCTTCTTATGCTTCTGAATAATCAAAGACTAATTTGTGATTTT 
ACTTTTTAATAGATATGACTTTGCTTCCAACATGGAATGAAATAAAAAATAAATAATAAAAG 
ATTGCCATGAATCTTGCAAAA 



nHHFiriTrrii 



FIGURE 47 

></usr/seqdb2 / ss t/DNA/Dnaseqs . min/ss . DNA3 6343 
xsubxinit 1 of 1, 289 aa, 1 stop 
xMW: 32268, pi: 9.21, NX(S/T): 0 

MVVWVTGASSGIGEELAYQLSKLGVSLVLSARRVHELERVKRRCLENGNLKEKDILVLPLDL 
TDTGSHEAATKAVLQEFGRIDILVNNGGMSQRSLCMDTSLDVYRKLIELNYLGTVSLTKCVL 
PHMI ERKQGKI VTVNS I LGI I S VPLS IGYCAS KHALRGFFNGLRTELAT YPGI I VSNI CPGP 
VQSNIVENSLAGEVTKTIGNNGDQSHKMTTSRCVRLMLISMANDLKEVWISEQPFLLVTYLW 
QYMPTWAPmi TNKMGKKRI ENFKSG''7DADSS YFKI FKTKHD 

Important Features: 
Signal Peptide: 

amino acids 1-31 

Transmembrane domain: 

amino acids 136-157 

Tyrosine kinase phosphorylation site. 

106-113 and 107-114 

Homologous region to Short-chain alcohol dehydrogenase 

amino acids 80-90, 131-168, 1-13 and 176-185 



FIGURE 48 



GCGACGTGGGCACCGCCATCAGCTGTTCGCGCGTCTTCTCCTCCAGGTGGGGCAGGGGTTTC 
GGGCTGGTGGAGCATGTGCTGGGACAGGACAGCATCCTCAATCAATCCAACAGCATATTCGG 
TTGCATCTTCTACACACTACAGCTATTGTTAGGTTGCCTGCGGACACGCTGGGCCTCTGTCC 
TGATGCTGCTGAGCTCCCTGGTGTCTCTCGCTGGTTCTGTCTACCTGGCCTGGATCCTGTTC 
TTCGTGCTCTATGATTTCTGCATTGTTTGTATCACCACCTATGCTATCAACGTGAGCCTGAT 
GTGGCTCAGTTTCCGGAAGGTCCAAGAACCCCAGGGCAAGGCTAAGAGGCACTGAGCCCTCA 
ACCCAAGCCAGGCTGACCTCATCTGCTTTGCTTTGGTCTTCAAGCCGCTCAGCGTGCCTGTG 
GACAGCGTGGCCCCGGCCCCCCCAAGCCTCAGGAGGGCAACACAGTCCCTGGCGAGTGGCCC 
TGGCAGGCCAGTGTGAGGAGGCAAGGAGCCCACATCTGCAGCGGCTCCCTGGTGGCAGACAC 
CTGGGTCCTCACTGCTGCCCACTGCTTTGAAAAGGCAGCAGCAACAGAACTGAATTCCTGGT 
CAGTGGTCCTGGGTTCTCTGCAGCGTGAGGGACTCAGCCCTGGGGCCGAAGAGGTGGGGGTG 
GCTGCCCTGCAGTTGCCCAGGGCCTATAACCACTACAGCCAGGGCTCAGACCTGGCCCTGCT 
GCAGCTCGCCCACCCCACGACCCACACACCCCTCTGCCTGCCCCAGCCCGCCCATCGCTTCC 
CCTTTGGAGCCTCCTGCTGGGCCACTGGCTGGGATCAGGACACCAGTGATGCTCCTGGGACC 
CTACGCAATCTGCGCCTGCGTCTCATCAGTCGCCCCACATGTAACTGTATCTACAACCAGCT 
GCACCAGCGACACCTGTCCAACCCGGCCCGGCCTGGGATGCTATGTGGGGGCCCCCAGCCTG 
GGGTGCAGGGCCCCTGTCAGGGAGATTCCGGGGGCCCTGTGCTGTGCCTCGAGCCTGACGGA 
CACTGGGTTCAGGCTGGCATCATCAGCTTTGCATCAAGCTGTGCCCAGGAGGACGCTCCTGT 
GCTGCTGACCAACACAGCTGCTCACi^GTTCCTGGCTGCAGGCTCGAGTTCAGGGGGCAGCTT 
TCCTGGCCCAGAGCCCAGAGACCCCGGAGATGAGTGATGAGGACAGCTGTGTAGCCTGTGGA 
TCCTTGAGGACAGCAGGTCCCCAGGCAGGAGCACCCTCCCCATGGCCCTGGGAGGCCAGGCT 
GATGCACCAGGGACAGCTGGCCTGTGGCGGAGCCCTGGTGTCAGAGGAGGCGGTGCTAACTG 
CTGCCCACTGCTTCATTGGGCGCCAC3GCCCCAGAGGAATGGAGCGTAGGGCTGGGGACCAGA 
CCGGAGGAGTGGGGCCTGAAGCAGCTCATCCTGCATGGAGCCTACACCCACCCTGAGGGGGG 
CTACGACATGGCCCTCCTGCTGCTGGCCCAGCCTGTGACACTGGGAGCCAGCCTGCGGCCCC 
TCTGCCTGCCCTATCCTGACCACCAGCTGCCTGATGGGGAGCGTGGCTGGGTTCTGGGACGG 
GCCCGCCCAGGAGCAGGCATCAGCTCCCTCCAGACAGTGCCCGTGACCCTCCTGGGGCCTAG 
GGCCTGCAGCCGGCTGCATGCAGCTCCTGGGGGTGATGGCAGCCCTATTCTGCCGGGGATGG 
TGTGTACCAGTGCTGTGGGTGAGCTGCCCAGCTGTGAGGGCCTGTCTGGGGCACCACTGGTG 
CATGAGGTGAGGGGCACATGGTTCCTGGCCGGGCTGCACAGCTTCGGAGATGCTTGCCAAGG 
CCCCGCCAGGCCGGCGGTCTTCACCGCGCTCCCTGCCTATGAGGACTGGGTCAGCAGTTTGG 
ACTGGCAGGTCTACTTCGCCGAGGA?iCCAGAGCCCGAGGCTGAGCCTGGAAGCTGCCTGGCC 
AACATAAGCCAACCAACCAGCTGCT<3ACAGGGGACCTGGCCATTCTCAGGACAAGAGAATGC 
AGGCAGGCAAATGGCATTACTGCCCCTGTCCTCCCCACCCTGTCATGTGTGATTCCAGGCAC 
CAGGGCAGGCCCAGAAGCCCAGCAGCTGTGGGAAGGAACCTGCCTGGGGCCACAGGTGCCCA 
CTCCCCACCCTGCAGGACAGGGGTGTCTGTGGACACTCCCACACCCAACTCTGCTACCAAGC 
AGGCGTCTCAGCTTTCCTCCTCCTTTACTCTTTCAGATACAATCACGCCAGCCACGTTGTTT 
TGAAAATTTCTTTTTTTGGGGGGCAGCAGTTTTCCTTTTTTTAAACTTAAATAAATTGTTAC 
AAAATAAAA 



FIGURE 49 



>< /us r / seqdb2 / s s t /DNA/Dnas eqs . min/ s s . DNA4 0 5 7 1 

MLLSSLVSLAGSVYIAWILFFVLYDFCIVCITTYAIIWSLMWLSFRKVQEPQGKAKRHGNTV 
PGEWPWQASVRRQGAHICSGSLVAD":rWVLTAAHCFEKAAATELNSWSVVLGSLQREGLSPGA 
EEVGVAALQLPRAYNHYSQGSDLALLQLAHPTTHTPLCLPQPAHRFPFGASCWATGWDQDTS 
DAPGTLRNLRLRL I SRPTCNCI YNQILHQRHLSNPARPGMLCGGPQPGVQGPCQGDSGGPVLC 
LEPDGHWVQAGIISFASSCAQEDAP7LLTNTAAHSSWLQARVQGAAPLAQSPETPEMSDEDS 
CVACGSLRTAGPQAGAPSPWPWEARLMHQGQLACGGALVSEEAVLTAAHCFIGRQAPEEWSV 
GLGTRPEEWGLKQLILHGAYTEIPEGGYDMALLLLAQPVTLGASLRPLCLPyPDHHLPDGERG 
WVLGRARPGAGI S S LQTVPVTLLGP RAG SRLHAAPGGDGS PI LPGMVCTSAVGELPSCEGLS 
GAPLVHEVRGTWFLAGLHSFGDACQGPARPAVFTALPAYEDWSSLDWQVYFAEEPEPEAEP 
GSCLANISQPTSC 

Important features: 
Signal peptide: 

amino acids 1-15 

Homologous region to Serine proteases, trypsin family 

amino acids 79-95, 343-359 and 237-247 

N-glycosylation sites. 

amino acids 37-40 and 564-567 

Kringle domains 

amino acids 79-96, 343-360 and 235-247 



FIGURE 50 



cgggccgcccccggcccccattcgggjccgggcctcgctgcggcggcgactgagccaggctgg 
gccgcgtccctgagtcccagagtcggcgcggcgcggcaggggcagccttccaccacggggag 
cccagctgtgagccgcctcacaggailigatgctgcgtcggcggggcagccctggcatgggtgt 
gcatgtgggtgcagccctgggagcac:tgtggttctgcctcacaggagccctggaggtccagg 
tccctgaagacccagtggtggcactggtgggcaccgatgccaccctgtgctgctccttctcc 
cctgagcctggcttcagcctggcacagctcaacctcatctggcagctgacagataccaaaca 
gctggtgcacagctttgctgagggccaggaccagggcagcgcctatgccaaccgcacggccc 
tcttcccggacctgctggcacaggggaacgcatccctgaggctgcagcgcgtgcgtgtggcg 

GACGAGGGCAGCTTCACCTGCTTCGTGAGCATCCGGGATTTCGGCAGCGCTGCCGTCAGCCT 

gcaggtggccgctccctactcgaagcccagcatgaccctggagcccaacaaggacctgcggc 
caggggacacggtgaccatcacgtgctccagctaccagggctaccctgaggctgaggtgttc 
tggcaggatgggcagggtgtgcccctgactggcaacgtgaccacgtcgcagatggccaacga 
gcagggcttgtttgatgtgcacagcgtcctgcgggtggtgctgggtgcgaatggcacctaca 
gctgcctggtgcgcaaccccgtgctcjcagcaggatgcgcacrgctctgtcaccatcacaggg 
cagcctatgacattccccccagaggccctgtgggtgaccgtggggctgtctgtctgtctcat 
tgcactgctggtggccctggctttcgftgtgctggagaaagatcaaacagagctgtgaggagg 
agaatgcaggagctgaggaccaggatggggagggagaaggctccaagacagccctgcagcct 
ctgaaacactctgacagcaaagaagatgatggacaagaaatagcctsiaccatgaggaccagg 
gagctgctacccctccctacagctcctaccctctggctgcaatggggctgcactgtgagccc 
tgcccccaacagatgcatcctgctctgacaggtgggctccttctccaaaggatgcgatacac 
agaccactgtgcagccttatttctccaatggacatgattcccaagtcatcctgctgcctttt 
ttcttatagacacaatgaacagaccacccacaaccttagttctctaagtcatcctgcctgct 
gccttatttcacagtacatacatttcttagggacacagtacactgaccacatcaccaccctc 
ttcttccagtgctgcgtggaccatctggctgccttttttctccaaaagatgcaatattcaga 
ctgactgaccccctgccttatttcaccaaagacacgatgcatagtcaccccggccttgtttc 
tccaatggccgtgatacactagtgatcatgttcagccctgcttccacctgcatagaatcttt 
tcttctcagacagggacagtgcggcc:tcaacatctcctggagtctagaagctgtttcctttc 
ccctccttcctccctgccccaagtgaagacagggcagggccaggaatgctttggggacaccg 

AGGGGACTGCCCCCCACCCCCACCAI'GGTGCTATTCTGGGGCTGGGGCAGTCTTTTCCTGGC 

ttgcctctggccagctcctggcctci'ggtagagtgagacttcagacgttctgatgccttccg 
gatgtcatctctccctgccccaggaatggaagatgtgaggacttctaatttaaatgtgggac 
tcggagggattttgtaaactgggggtatattttggggaaaataaatgtctttgtaaaaaaaa 

AAAAAAA/y^AAAAA 



></usr/ seqdb2/sst/DNA/Dnaseqs .min/ss ,DNA413 86 
xsubunit 1 of 1, 316 aa, 1 stop, 1 xinknown 
xMW: -1, pi: 4.62, NX(S/T): 4 

MLRRRGSPGMGVHVGAALGALWFCLTGALEVQVPEDPWALVGTDATLCCSFSPEPGFSLAQ 
LNLIWQLTDTKQLVHSFAEGQDQGSAYANRTALFPDLLAQGNASLRLQRVRVADEGSFTCFV 
SIRDFGSAAVSLQVAAPYSKPSMTLEPNKDLRPGDTVTITCSSYQGYPEAEVFWQDGQGVPL 
TGNVTTSQMANEQGLFDVHSVLRV^TiGANGTYSCLVRMPVLQQDAHXSVTITGQPMTFPPEA 
LWVTVGLSVCLIALLVALAFVCWRKIKQSCEEENAGAEDQDGEGEGSKTALQPLKHSDSKED 
DGQEIA 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 251-270 

N-glycosylation site. 

amino acids 91-94, 104-107, 189-192 and 215-218 

Homologous region to Immunoglobulins and MHC 

amino acids 217-234 



FIGURE 52 

ttcgtgacccttgagaaaagagttggtggtaaatgtgccacgtcttctaagaagggggagtc 
ctgaacttgtctgaagcccttgtcc:gtaagccttgaactacgttcttaaatctatgaagtcg 
agggacctttcgctgcttttgtagggacttctttccttgcttcagcaac atga ggcttttct 
tgtggaacgcggtcttgactctgttcgtcacttctttgattggggctttgatccctgaacca 
gaagtgaaaattgaagttctccagaagccattcatctgccatcgcaagaccaaaggagggga 
tttgatgttggtccactatgaaggctacttagaaaaggacggctccttatttcactccactc 
acaaacataacaatggtcagcccatttggtttaccctgggcatcctggaggctctcaaaggt 
tgggaccagggcttgaaaggaatgtgtgtaggagagaagagaaagctcatcattcctcctgc 
tctgggctatggaaaagaaggaaaj%.ggtaaaattcccccagaaagtacactgatatttaata 
ttgatctcctggagattcgaaatggfaccaagatcccatgaatcattccaagaaatggatctt 
aatgatgactggaaactctctaaagatgaggttaaagcatatttaaagaaggagtttgaaaa 

ACATGGTGCGGTGGTGAATG2\AAGTCATCATGATGCTTTGGTGGAGGATATTTTTGATAAAG 
AAGATGAAGACAAAGATGGGTTTAIATCTGCCAGAGAATTTACATATAAACACGATGAGTTA 
TAGAGATACATCTACCCTTTTAATATAGCACTCATCTTTCAAGAGAGGGCAGTCATCTTTAA 
AGAACATTTTATTTTTATACAATGITCTTTCTTGCTTTGTTTTTTATTTTTATATATTTTTT 
CTGACTCCTATTTAAAGAACCCCTTAGGTTTCTAAGTACCCATTTCTTTCTGATAAGTTATT 
GGGAAGAAAAAGCTAATTGGTCTTTGAATAGAAGACTTCTGGACAATTTTTCACTTTCACAG 
ATATGAAGCTTTGTTTTACTTTCTCACTTATAAATTTAAAATGTTGCAACTGGGAATATACC 
ACGACATGAGACCAGGTTATAGCACAflATTAGCACCCTATATTTCTGCTTCCCTCTATTTTC 
TCCAAGTTAGAGGTCAACATTTGAAZ^GCCTTTTGCAATAGCCCAAGGCTTGCTATTTTCAT 
GTTATAATGAAATAGTTTATGTGTJiACTGGCTCTGAGTCTCTGCTTGAGQACCAGAGGAAAA 
TGGTTGTTGGACCTGACTTGTTAATGGCTACTGCTTTACTAAGGAGATGTGCAATGCTGAAG 
TTAGAAACAAGGTTAATAGCCAGGCATGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAG 
GCTGAGGCGGGCGGATCACCTGAGGTTGGGAGTTCGAGACCAGCCTGACCAACACGGAGAAA 
CCCTATCTCTACTAAAAATACAAAGTAGCCCGGCGTGGTGATGCGTGCCTGTAATCCCAGCT 
ACCCAGGAAGGCTGAGGCGGCAGAATCACTTGAACCCGAGGCCGAGGTTGCGGTAAGCCGAG 
ATCACCTNCAGCCTGGACACTCTGTCTCGAAAAAAGAAAAGAACACGGTTAATACCATATNA 
ATATGTATGCATTGAGACATGCTACCTAGGACTTAAGCTGATGAAGCTTGGCTCCTAGTGAT 
TGGTGGCCTATTATGATAAATAGGACAAATCATTTATGTGTGAGTTTCTTTGTAATAAAATG 
TATCAATATGTTATAGATGAGGTAG.AAAGTTATATTTATATTCAATATTTACTTCTTAAGGC 
TAGCGGAATATCCTTCCTGGTTCTTTAATGGGTAGTCTATAGTATATTATACTACAATAACA 
TTGTATCATAAGATAAAGTAGTAAACCAGTCTACATTTTCCCATTTCTGTCTCATCAAAAAC 
TGAAGTTAGCTGGGTGTGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGGGCCAAGGAGGG 
TGGATCACTTGAGATCAGGAGTTCAif^GACCAGCCTGGCCAACATGGTGAAACCTTGTCTCTA 
CTAAAAATACAAAAATTAGCCAGGCGTGGTGGTGCACACCTGTAGTCCCAGCTACTCGGGAG 
GCTGAGACAGGAGATTTGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCCAAGATTGTGCC 
ACTGCACTCCAGCCTGGGTGACAGAGCAAGACTCCATCTCAAAAAAAAAAAAAAGAAGCAGA 
CCTACAGCAGCTACTATTGAATAAATACCTATCCTGGATTTT 



FIGURE 53 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA44194 
xsubunit 1 of 1, 211 aa, 1 stop 
xMW: 24172, pi: 5.99, NX(S/T): 1 

MRLFLWNAVLTLFVTSLIGALIPEPEVKIEVLQKPFICHRKTKGGDLMLVHYEGYLEKDGSL 
FHSTHKHNNGQPIWFTLGILEALKGWDQGLKGMCVGEKRKLIIPPALGYGKEGKGKIPPEST 
LI FNI DLLE IRNGPRSHE S FQEiynDniSroDWKLSKDEVKAYLKKE FEKHGAVWESHH^ 
IFDKEDEDKDGFISAREFTYKHDEL 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 176-179 

Casein kinase II phosphorsrlation site. 

amino acids 143-146, 156-159, 178-181 and 200-203 

Endoplasmic reticulum targeting sequence. 

amino acids 208-211 

FKBP-type pep t idyl -prolyl cis-trans isomerase 

amino acids 78-114 and 118-131 

EF-hand calcium-binding domain. 

amino acids 191-203, 184-203 and 140-159 

S-lOO/ICaBP type calcixm binding domain 

amino acids 183-203 



FIGURE 54 

AATAAAGCTTCCTTAATGTTGTATATGTCTTTGAAGTACATCCGTGCATTTTTTTTTAGCAT 
CCAACCATTCCTCCCTTGTAGTTCTCGCCCCCTCAAATCACCCTCTCCCGTAGCCCACCCGA 
CTAACATCTCAGTCTCTGAAAATGCACAGAGATGCCTGGCTACCTCGCCCTGCCTTCAGCCT 
CACGGGGCTCAGTCTCTTTTTCTCTTTGGTGCCACCAGGACGGAGCATGGAGGTCACAGTAC 
CTGCCACCCTCAACGTCCTCAATGGCTCTGACGCCCGCCTGCCCTGCACCTTCAACTCCTGC 
TACACAGTGAACCACAAACAGTTCTCCCTGAACTGGACTTACCAGGAGTGCAACAACTGCTC 
TGAGGAGATGTTCCTCCAGTTCCGCATGAAGATCATTAACCTGAAGCTGGAGCGGTTTCAAG 
ACCGCGTGGAGTTCTCAGGGAACCCCAGCAAGTACGATGTGTCGGTGATGCTGAGAAACGTG 
CAGCCGGAGGATGAGGGGATTTACA?ICTGCTACATCATGAACCCCCCTGACCGCCACCGTGG 
CCATGGCAAGATCCATCTGCAGGTCCTCATGGAAGAGCCCCCTGAGCGGGACTCCACGGTGG 
CCGTGATTGTGGGTGCCTCCGTCGGGGGCTTCCTGGCTGTGGTCATCTTGGTGCTGATGGTG 
GTCAAGTGTGTGAGGAGAAAAAAAGl^GCAGAAGCTGAGCACAGATGACCTGAAGACCGAGGA 
GGAGGGCAAGACGGACGGTGAAGGCiy^CCCGGATGATGGCGCCAAGTAGTGGGTGGCCGGCC 
CTGCAGCCTCCCGTGTCCCGTCTCCTCCCCTCTCCGCCCTGTACAGTGACCCTGCCTGCTCG 
CTCTTGGTGTGCTTCCCGTGACCTAGGACCCCAGGGCCCACCTGGGGCCTCCTGAACCCCCG 
ACTTCGTATCTCCCACCCTGCACCA^IGAGTGACCCACTCTCTTCCATCCGAGAAACCTGCCA 
TGCTCTGGGACGTGTGGGCCCTGGGGAGAGGAGAGAAAGGGCTCCCACCTGCCAGTCCCTGG 
GGGGAGGCAGGAGGCACATGTGAGGGTCCCCAGAGAGAAGGGAGTGGGTGGGCAGGGGTAGA 
GGAGGGGCCGCTGTCACCTGCCCAGTGCTTGCCTGGCAGTGGCTTCAGAGAGGACCTGGTGG 
GGAGGGAGGGCTTTCCTGTGCTGACAGCGCTCCCTCAGGAGGGCCTTGGCCTGGCACGGCTG 
TGCTCCTCCCCTGCTCCCAGCCCAGAGCAGCCATCAGGCTGGAGGTGACGATGAGTTCCTGA 
AACTTGGAGGGGCATGTTAAAG6GATGACTGTGCATTCCAGGGCACTGACGGAAAGCCAGGG 
CTGCAGGCAAAGCTGGACATGTGCCCTGGCCCAGGAGGCCATGTTGGGCCCTCGTTTCCATT 
GCTAGTGGCCTCCTTGGGGCTCCTGTTGGCTCCTAATCCCTTAGGACTGTGGATGAGGCCAG 
ACTGGAAGAGCAGCTCCAGGTAGGGI3GCCATGTTTCCCAGCGGGGACCCACCAACAGAGGCC 
AGTTTCAAAGTCAGCTGAGGGGCTGAGGGGTGGGGCTCCATGGTGAATGCAGGTTGCTGCAG 
GCTCTGCCTTCTCCATGGGGTAACCACCCTCGCCTGGGCAGGGGCAGCCAAGGCTGGGAAAT 
GAGGAGGCCATGCACAGGGTGGGGCAGCTTTCTTTGGGGCTTCAGTGAGAACTCTCCCAGTT 
GCCCTTGGTGGGGTTTCCACCTGGCTTTTGGCTACAGAGAGGGAAGGGAAAGCCTGAGGCCG 
GCATAAGGGGAGGCCTTGGAACCTGAGCTGCCAATGCCAGCCCTGTCCCATCTGCGGCCACG 
CTACTCGCTCCTCTCCCAACAACTCCCTTCGTGGGGACAAAAGTGACAATTGTAGGCCAGGC 
ACAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGTGGATTACCTCCAT 
CTGTTTAGTAGAAATGGGCAAAACCCCATCTCTACTAAAAATACAAGAATTAGCTGGGCGTG 
GTGGCGTGTGCCTGTAATCCCA.GCTATTTGGGAGGCTGAGGCAGGAGAATCGCTTGAGCCCG 
GGAAGCAGAGGTTGCAGTGAACTGAGATAGTGATAGTGCCACTGCAATTCAGCCTGGGTGAC 
ATAGAGAGACTCCATCTCAAAAAAAS. 



FTGURE 55 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA45415 
<siibunit 1 of 1, 215 aa, 1 stop 
<MW: 24326, pi: 6.32, NX(S/T): 4 

MHRDAWLPRPAFSLTGLSLFFSLVPPGRSMEVTVPATIJSrVliNGSDARLPCTFNSCYI^^ 
FSIJWTYQECimCSEEMFLQFRMKIINLKLERFQDRVEFSGNPSKYDVSVMLRNVQPEDEGI 
YNCYIMNPPDRHRGHGKIHLQVLMEEPPERDSTVAVIVGASVGGFLAWILVLMWKCVRRK 
KEQKLSTDDLKTEEEGKTDGEGNPDDGAK 

Important features: 
Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 161-179 

Immunoglobulin- like fold: 

amino acids 83-127 

N-glycosylation sites. 

amino acids 42-45, 66-69 and 74-77 



FIGURE 56 



GTTGTATATGTCCTGAAGTACATCCGTGCATTTTTTTTAGCATCCAACCATCCTCCCTTGTA 
GTTCTCGCCCCCTCAAATCACCTTCTCCCTTAGCCCACCCNACTAACATCTCAGTCTCTGAA 
AATGCACAGAGATGCCTGGCTACCTCGCCCTGCCTTCAGCCTCACGGGGCTCAGTCTCTTTT 
TCTCTTTGGTGCCACCAGGACGGAGGATGGAGGTCCACAGTACCTGNCCACCCTCAACGTCC 
TCAATGGCTCTGACGCCCGCCTGCCCTGCCCTTCAACTCCTGCTACACAGTGAACCACAAAC 
AGTTCTCCCTGAACTGGACTTACCAGGAGTGCAACAACTGCTCTGAGGAGATGTTCCTCCAG 
TTCCGCATGAAGATCATTAACCTGAi\GCTGGAGCGGTTTCAAGACCGCGTGGAGTTCTCAGG 
GAACCCCAGCAAGTACGATGTGTCGGfTGATGCTGAGAAACGTGCAGCCGGAGGATGAGGGGA 
TTTACAACTGCTACATCATGAACCCCCC 



■fl" 



E'IGURE 57 



TCACGGGGCTCATCTCTTTTTCTCTTTGGTGCCCACCAGGACGGAGCATGGAGGTNCACATA 
CCTGCCACCCTCAACGTCCTCAATGGCTTTGACGCCCGCCTGCCCTGCACCTTCAACTCCNG 
CTACACAGTGAACCACAAACAGTTCTCCCTGAACTGGATTTACCAGGAGTGCAACAACTGGC 
TCTGAGGAGATGTTCCTCCAGTTCCCGCATGGAAGATCATTTAACCTGAAAGCTGGAAGCGG 
TTTTCAAGAACCGCGTGGAAGTTTCTCAGGGAACCCCAGCAAGTACGATGTGTCGGTGATGC 
TGAGAAACGTGCAGCCGGAGGATGAGGGGATTTACAACTGCTACATCATGAACCCCCC 



FIGURE 58 



TGCGGCGACCGTCGTACa^CC aTGG GCCTCCa.CCTCCGCCCCTACCGTGTGGGGCTGCTCCCGGATGGCCTCCTGT 

TCCTCTTGCTGCTGCTAATGCTGCTCGCGG5.CCCAGCGCTCCCGGCCGGACGTCACCCCCCAGTGGTGCT6GTCC 

CTGGTGATTTGGGTAACCAACTGGAAGCCA?^GCTGGACaAGCCGAa^GTGGTGCACTACCTCTGCTCa^GZ^ 

CCGAAAGCTACTTCa.CaATCTGGCTGAACCTGGAACTGCTGCTGCCTGTCATCATTGACTGCTGGATTGAC^ 

TCAGGCTGGTTTACAACAAAACATCCAGGGCCACCCAGTTTCCTGATGGTGTGGATGTACGTGTCCCTGGCTTTG 

GGAAGACCTTCTCACTGGAGTTCCTGGACCCCAGCAAAAGaVGCGTGGGTTCCTATTTCCACa.CCaT 

GCCTTGTGGGCTGGGGCTACacakCGGGGTGft.GGATGTCCGAGGGGCTCCCTATGACTGGCGCCGAGCCCCAAATG 

AAAACGGGCCCTACTTCCTGGCCCTCCGCGSiGATGATCGAGGAGATGTACCAGCTGTATGGGGGCCCCGTGGTGC 

TGGTTGCCCAOIGTATGGGCAACATGTACACGCTCTACTTTCTGCAGCGGG^GCCGCAGGCCTGGAAGC^ 

ACAACAACCGGATCCCAGTCaTCGGGCCCCTGAAGATCCGGGAGCAGaVGCGGTCAGCTGTCTCa^ 

TGCTGCCCTACaACTACACATGGTC:ACCTGft.GAAGGTGTTCGTGCAGACS^CCCACAATCAACTAC:AC^ 

ACTACCGOiAGTTCTTCCAGGACATCGGCTTTGAAGATGGCTGGCTCATGCGGCAGGACACAGAAGGGCTGGTGG 

AAGCCACGATGCCACCTGGCGTGCAGCTGC^iCTGCCTCTATGGTACTGGCGTCCCCACACCAGACTCCTTCTACT 

ATGAGAGCTTCCCTGACCGTGACCCTAAAATCTGCTTTGGTGACGGCGATGGTACTGTGAACTTGAAGAGTGCCC 

TGCAGTGCaVGGCCTGGO^GAGCCGCCAGGfiGCACCa^GTGTTGCTGCAGGAGCTGCCAGGCA^ 

AGATGCTGGCCAACGCCACCa.CCCTGGCCTATerGAAACGTGTGCTCCTTGGGCC CTGA CTCCTGTGCCACAGGA 

CTCCTGTGGCTCGGCCGTGGACCTGCTGTTCiGCCTCTGGGGCTGTCATGGCCCACGCGTTTTGCAAAGTTTGTGA 

CTCACCATTCmGGCCCCGAGTCTTGGACTG(TGAAGCATCTGCCa.TGGGGAAGTGCTGTTTGTTATCCTTTCTCT 

GTGGGAGTGAAQAAGGAAGaAATGAGAGTCTAGACTCAAGGGACACTGGATGGCAAQAATGCTGCTGATGGTGGA 

ACTGCTGTGACCTTAGGACTGGCTCCACAGGGTGGACTGGCTGGGCCCTGGTCCCAGTCCCTGCCTGGGGCCATG 

TGTCCCCCTATTCCTGTGGGCTTTTCATACTTGCCTACTGGGCCCTGGCCCCGCAGCCTTCCTATGAGGGATGTT 

ACTGGGCTGTGGTCCTGTACCCAGAGGTCCCAGGGATCGGCTCCTGGCCCCTCGGGTGACCCTTCCCACACACCA 

GCCACAGATAGGCCTGCCACTGGTCATGGGTAGCTAGAGCTGCTGGCTTCCCTGTGGCTTAGCTGGTGGCCAGCC 

TGACTGGCTTCCTGGGCGAGCCTAGTAGCTCCTGCAGGCAGGGGCAGTTTGTTGCGTTCTTCGTGGTTCCCAGGC 

CCTGGGACATCTCACTCCACTCCTACCTCCCTTAC<:a.CC7lGGAGCATTa^GCTCTGGATTGGGCaLGCAG^ 

CCCCCAGTCCCGCAGGCTGTGTTCCAGGGGCCCTGATTTCCTCGGATGTGCTATTGGCCCCAGGACTGAAGCTGC 

CTCCCTTCACCCTGGGACTGTGGTTCCaAGGATGAGAGCAGGGGTTGGAGCCATGGCCTTCTGGGAACCTATGGA 

GZ^GGGAATCCAAGGAAGCAGCO^GGCTGCTCGCAGCTTCCCTGAGCTGCACCTCTTGCTAACCCa^CCATC^ 

CAOTGCCACCCTGCCCTAGGGTCTCACTAGTACCAAGTGGGTCAGCACAGGGCTGAGQATGGG^ 

CCTGGCCAGCACCCAGCTTAGTGCTGGGACTAGCCCAGAAACTTGAATGGGACCCTGAGAGAGCCAGGGGTCCCC 

TGAGGCCCCCCTAGGGGCTTTCTGTCTGCCCCAGGGTGCTCCATGGATCTCCCTGTGGCAGCAGGCATGGA^^ 

CaGGGCTGCCTTCATGGCaGTAGGCTCTAAGTGGGTGACTGGCCACyiGGCCGAGA2^GGGTACAGCCTCTAGGT 

GGGGTTCCCAAAGACGCCTTC2VGGCTGGACTGAGCTGCTCTCCCACAGGGTTTCTGTGCAGCTGGATTTTCTCTG 

TTGCATACSVTGCCTGGCa.TCTGTCTCCCCTTGTTCCTGAGTGGCCCCa^a^TGGGGCTCTGAGCAGGCTGTATCTG 

GATTCTGGCAATAAAAGTACTCTGGATGCTGTAAAA2kAAAAAAAAAAAAAAAAAA 



FIGURE 5 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA44189 
xsubunit 1 of 1, 412 aa, 1 stop 
xMW: 46658, pi: 6.65, NX(S/T): 4 

MGLHLRPYRVGLLPDGLLFLLLLLMIjLADPALPAGRHPPVVLVPGDLGNQLEAKLDKPTVVH 
YLCSKKTESYFTIWLNLELLLPVIIDCWIDNIRLVYNKTSRATQFPDGVDVRVPGFGKTFSL 
EFLDPSKSSVGSYFHTMVESLVGWGYTRGEDVRGAPYDWRRAPNENGPYFLALREMIEEMYQ 
LYGGPWLVAHSMGNMYTLYFLQRQPQAWKDKYIRAFVSLGAPWGGVAKTLRVLASGDNWRI 
PVIGPLKIREQQRSAVSTSWLLPYNYTWSPEKVFVQTPTINYTLRDYRKFFQDIGFEDGWLM 
RQDTEGLVEATMPPGVQLHCLYGTGVPTPDSFYYESFPDRDPKICFGDGDGTVNLKSALQCQ 
AWQSRQEHQVLLQELPGSEHIEMLANATTLAYLKRVLLGP 

Important features: 
Signal peptide: 

amino acids 1-28 

Potential lipid substrate binding site: 

amino acids 147-164 

N-glycosylation sites. 

amino acids 99-102, 273-276, 289-292 and 398-401 

Lipases, serine proteins 

amino acids 189-201 

Beta-transducin family Trp-Asp repeat 

amino acids 353-365 



FIGURE 60 



CGGACGCGTGGGCGGACGCGTGGGGCGGCGGCAGCGGCGGCGACGGCGACATGGAGAGCGGG 

gcctacggcgcggccaaggcgggcggctccttcgacctgcggcgcttcctgacgcagccgca 
ggtggtggcgcgcgccgtgtgcttg(3tcttcgccttgatcgtgttctcctgcatctatggtg 
agggctacagcaatgcccacgagtctaagcagatgtactgcgtgttcaaccgcaacgaggat 
gcctgccgctatggcagtgccatcggggtgctggccttcctggcctcggccttcttcttggt 
ggtcgacgcgtatttcccccagatcagcaacgccactgaccgcaagtacctggtcattggtg 
acctgctcttctcagctctctggaccttcctgtggtttgttggtttctgcttcctcaccaac 
cagtgggcagtcaccaacccgaaggacgtgctggtgggggccgactctgtgagggcagccat 
caccttcagcttcttttccatcttctcctggggtgtgctggcctccctggcctaccagcgct 
acaaggctggcgtggacgacttcatccagaattacgttgaccccactccggaccccaacact 
gcctacgcctcctacccaggtgcatc:tgtggacaactaccaacagccacccttcacccagaa 
cgcggagaccaccgagggctaccagccgccccctgtgtactgagtggcggttagcgtgggaa 
gggggacagagagggccctcccctctgccctggactttcccatcagcctcctggaactgcca 
gcccctctctttcacctgttccatcgtgtgcagctgacacacagctaaggagcctcatagcc 
tggcgggggctggcagagccacaccccaagtgcctgtgcccagagggcttcagtcagccgct 

CACTCCTCCAGGGCACTTTTAGGAAJIiGGGTTTTTAGCTAGTGTTTTTCCTCGCTTTTAATGA 
CCTCAGCCCCGCCTGCAGTGGCTAGi^AGCCAGCAGGTGCCCATGTGCTACTGACAAGTGCCT 
CAGCTTCCCCCCGGCCCGGGTCAGGC:CGTGGGAGCCGCTATTATCTGCGTTCTCTGCCAAAG 
ACTCGTGGGGGCCATCACACCTGCCC^TGTGCAGCGGAGCCGGACCAGGCTCTTGTGTCCTCA 
CTCAGGTTTGCTTCCCCTGTGCCCACTGCTGTATGATCTGGGGGCCACCACCCTGTGCCGGT 
GGCCTCTGGGCTGCCTCCCGTGGTGTGAGGGCGGGGCTGGTGCTCATGGCACTTCCTCCTTG 
CTCCCACCCCTGGCAGCAGGGAAGGGCTTTGCCTGACAACACCCAGCTTTATGTAAATATTC 
TGCAGTTGTTACTTAGGAAGCCTGGG5GAGGGCAGGGGTGCCCCATGGCTCCCAGACTCTGTC 
TGTGCCGAGTGTATTATAAAATCGTGGGGGAGATGCCCGGCCTGGGATGCTGTTTGGAGACG 
GAATAAATGTTTTCTCATTCAAAG 



FIGURE 61 

</usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA48304 
<siibunit 1 of 1, 224 aa, 1 stop 
<MW: 24810, pi: 4.75, NX(S/T): 1 

MESGAYGAAKAGGSFDLRRPLTQPQWARAVCLVFALIVFSCIYGEGYSNAHESKQMYCVFN 
RNEDACRYGSAIGVLAFLASAFFLVVDAYFPQISNATDRKYLVIGDLLFSALWTFLWFVGFC 
PLTNQWAVTNPKDVLVGADSVRAAITFSFFSIFSWGVLASLAYQRYKAGVDDFIQNYVDPTP 

dpntayasypgasvdnyqqppftqn:\ettegyqpppvy 

Important features: 

Type II Transmembrane domain: 

amino acids 1-45 

Other transmembrane domains: 

amino acids 74-90, 108-12(5 and 145-161 

N-glycosylation site. 

amino acids 97-100 



FIGURE 62 

GAGCGj^CCTACCCTGCTCCGAGGCCAGGCCTGCaGGGCCTCa.TCGGCCAGAGGGTGATCAGTGAQCAGaAG GA^ 

CCCGTGGCCGAGGCCCCCCAGGTGGCTGGCGGGCAGGGGGACGGAGGTGATGGCGAGGAAGCGGAGCCAGftGGGG 

ATGTTCAAGGCCTGTGAGGACTCCAAGAGAAAAGCCCGGGGCTACCTCCGCCTGGTGCCCCTGTTTGTGCTGCTG 

GCCCTGCTCGTGCTGGCTTCGGCGGGGGTGCTACTCTGGTATTTCCTAGGGTACAAGGCGGAGGTGATGGTCAGC 

CAGGTGTACTCAGGCAGTCTGCGTGTACTCAATCGCCACTTCTCCCAGGATCTTACCCGCCGGGAATCTAGTGCC 

TTCCGCaGTGAAACCGCC:aAAGCCCaGAAGATGCTCAAGGAGCTCa.TC:ACCAGCyiCCCGCCTGGGAA 

AACTCCAGCTCCGTCTATTCCTTTGGGGAGGGACCCCTCACCTGCTTCTTCTGGTTCATTCTCCAAATCCCCGAG 

CACCGCCGGCTGATGCTGAGCCCCGAGGTGGTGCAGGCACTGCTGGTGGAGGAGCTGCTGTCCACAGTCAACAGC 

TCGGCTGCCGTCCCCTACAGGGCCGAGTACGAAGTGGACCCCGAGGGCCTAGTGATCCTGGAAGCCAGTGTGAAA 

GACATAGCTGCATTGAATTCCACGCTGGGTTGTTACCGCTACAGCTACGTGGGCCAGGGCCAGGTCCTCCGGCTG 

AAGGGGCCTGACCa.CCTGGCCTCCAGCTGCCTGTGGCACCrGCTlGGGCCCCAAGQACCTCyiTGCTCAAACTCCGG 

CTGGAGTGGACGCTGGCAGAGTGCCGGGACCGACTGGCCATGTATGACGTGGGCGGGCCCCTGGAGAAGAGGCTC 

ATCACCTCGGTGTACGGCTGCAGCCGCCAGGAGCCCGTGGTGGAGGTTCTGGCGTCGGGGGCCATCATGGCGGTC 

GTCTGGAAGAAGGGCCTGCACAGCTACTACGACCCCTTCGTGCTCTCCGTGCAGCCGGTGGTCTTCCAGGCCTGT 

GAAGTGAACCTGACGCTGGACAACAGGCTCSACTCCCAGGGC6TCCTCAGCACCCCGTACTTCCCCAGCTACTAC 

TCGCCCCAAACCCACTGCTCCTGGCACCTC^CGGTGCCCTCTCTGGACTACGGCTTGGCCCTCTGGTTTGATGCC 

TATGCACTGAGGAGGCAGAAGTATGATTTGCCGTGCaLCCCAGGGCCmGTGGACGATCCAGAAa^GGAGGCTG 

GGCTTGCGCATCCTGCAGCCCTACGCCGAGAGGATCCCCGTGGTGGCCACGGCCGGGATCACCATCAACTTCACC 

TCCCAGATCTCCCTCACCGGGCCCGGTGTGCGGGTGCACTATGGCTTGTAC7VACCAGTCGGACCCCTGCCCTGGA 

GAGTTCCTCTGTTCTGTGAATGGACTCTGTGTCCCTGCCTGTGATGGGGTCAAGQACTGCCCCAACGGCCTGGAT 

GAGAGAAACTGCGTTTGCAGAGCCyiCATTCiCAGTGCAAAGAGGACAGCACATGCSlTCTCaCTGCCCAAGGT^ 

GATGGGCAGCCTGATTGTCTCaACGGCAGOSATGAAGAGCAGTGCCAGGAAGGGGTGCCATGTGGGACATTCACC 

TTCCAGTGTGAGGACCGGAGCTGCGTGAAGiiiAGCCCAACCCGCAGTGTGATGGGCGGCCCGACTGCAGGGACGGC 

TCGGATGAGGAGCACTGTGACTGTGGCCTCCAGGGCCCCTCCAGCCGCATTGTTGGTGGAGCTGTGTCCTCCGAG 

GGTGAGTGGCCATGGGAGGCCAGCCTCCAGGTTCGGGGTCGAOICATCTGTGGGGGGGCCCTCATCGCTGACCGC 

TGGGTGATAACAGCTGCCCACTGCTTCCAGGAGGACAGCATGGCCTCCACGGTGCTGTGGACCGTGTTCCTGGGC 

AAGGTGTGGCAGAACTCGCGCTGGCCTGGA<3AGGTGTCCTTCAAGGTGAGCCGCCTGCTCCTGCACCCGTACCAC 

GAAGAGGACAGCCATGACTACGACGTGGCGCTGCTGCAGCTCGACCACCCGGTGGTGCGCTCGGCCGCCGTGCGC 

CCCGTCTGCCTGCCCGCGCGCTCCCACTTCTTCGAGCCCGGCCTGCACTGCTGGATTACGGGCTGGGGCGCCTTG 

CGCGAGGGCGGCCCCATCAGCAACGCTCTGCAGAAAGTGGATGTGCaGTTGATCCCACAGGACCTGTGCAGCGAG 

GCCTATCGCTACCAGGTGACGCCACGCATGCTGTGTGCCGGCTACCGCAAGGGCAAQAAGGATGCCTGTCAGGGT 

GACTCAGGTGGTCCGCTGGTGTGCftAGGCACTCAGTGGCCGCTGGTTCCTGGCGGGGCTGGTCAGCTGGGGCCTG 

GGCTGTGGCCGGCCTAACTACTTCGGCGTCTACACCCGCATCACAGGTGTGATCAGCTGGATCCAGCAAGTGGTG 

ACCTGAGGAACTGCCCCCCTGCAAAGCyVGGiSCCCACCTCCTGGACTCAGAGAGCCCAGGGCAACTGCCAAGC^ 

GGGACAAGTATTCTGGCGGGGGGTGGGGGAI3AGAGCAGGCCCTGTGGTGGCAGGAGGTGGCATCTTGTCTCGTCC 

CTGATGTCTGCTCCAGTGATGGCAGGAGGATGGAGAAGTGCCAGCAGCTGGGGGTCAAGACGTCCCCTGAGGACC 

CAGGCCCACACCCAGCCCTTCTGCCTCCCAi\TTCTCTCTCCTCCGTCCCCTTCCTCCACTGCTGCCTAATGCAAG 

GCAGTGGCTCAGOIGCS^GAATGCTGGTTCTACATCCCGAGGAGTGTCTGAGGTGCGCCCCACTCTGTACAGAGG 

CTGTTTGGGCAGCCTTGCCTCCAGAGAGCA(3ATTCCAGCTTCGQAAGCCCCTGGTCTAACTTGGGATCTGGGAAT 

GGAAGGTGCTCCCATCGGAGGGGACCCTCAGAGCCCTGGAGACTGCCAGGTGGGCCTGCTGCCACTGTAAGCCAA 

AAGGTGGGGAAGTCCTGACTCCAGGGTCCTTGCCCCACCCCTGCCTGCCACCTGGGCCCTCACAGCCCAGACCCT 

CaCTGGGAGGTGAGCTCmGCTGCCCTTTGGiy^TAAAGCTGCCTGATCAAAAAAAAAAAAAAAAAAAAA 



></usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA49152 
xsubunit 1 of 1, 802 aa, 1 stop 
xMW: 88846, pi: 6.41, NX(S/T): 7 

MPVAEAPQVAGGQGDGGDGEEAEPEGMFKACEDSKRKARGYLRLVPLFVLLALLVLASAGVL 

LWYFLGYKAEVMVSQVYSGSLRVLNRHFSQDLTRRESSAPRSETAKAQKMLKELITSTRI.GT 

YYNSSSVYSFGEGPLTCFFWFILQIPEHRRLMLSPEWQALLVEELLSTVNSSAAVPYRAEY 

EVDPEGLVILEASVKDIAALNSTLGJCYRYSYVGQGQVLRLKGPDHLASSCLWHLQGPiODLML 

KLRLEWTLAECRDRIiAiyrroVAGPLEKRLITSWGCSRQEPVVEVLASGAIMA'^^ 

YDPFVLSVQPWFQACEVNLTLDNRLDSQGVLSTPYFPSYYSPQTHCSWHLTVPSLDYGLAL 

WFDAYALRRQKYDLPCTQGQWTIQNRRLCGLRILQPYAERIPWATAGITINFTSQISLTGP 

GVRVHYGLYNQSDPCPGEFLCSVNGLCVPACDGVKDCPNGLDERNCVCRATFQCKEDSTCIS 

LPKVCDGQPDCLNGSDEEQCQEGVPCGTFTFQCEDRSCVKKPNPQCDGRPDCRDGSDEEHCD 

CGLQGPSSRIVGGAVSSEGEWPWQASLQVRGRHICGGALIADRWVITAAHCFQEDSMASTVL 

WTVFLGKVWQNSRWPGEVSFKVSRLiLLHPYHEEDSHDYDVALLQLDHPWRSAAVRPVCLPA 

RSHFFEPGLHCWITGWGALREGGPISNALQKVDVQLIPQDLCSEAYRYQVTPRMLCAGYRKG 

KKDACQGDSGGPLVCKALSGRWFLAGLVSWGLGCGRPNYFGVYTRITGVISWIQQVVT 

Important features: 

Type II transmesncibrane domain: 

amino acids 46-67 

Serine proteases, trypsin family, histidine active site, 
amino acids 604-609 
N-glycosylation sites. 

amino acids 127-130, 175-178, 207-210, 329-332, 424-427, 444-447 
and 509-512 
Kringle domains. 

amino acids 746-758 and 592-609 

Homologous region to Kallikrein Light Chain: 

amino acids 568-779 

Homologous region to Low-density lipoprotein receptor: 

amino acids 451-567 



FIGURE 64 

GCACCCAGGGCCAGTGGACGATCCAGAACAGGAGGCTGTGTGGCTTGCGCATCCTGCAGCCC 
TACGCCGAGAGGATCCCCGTGGTGGCCACGGCCGGGATCACCATCAACTTCACCTCCCAGAT 
CTCCCTCACCGGGCCCGGTGTGCGGCSTGCACTATGGCTTGTACAACCAGTCGGACCCCTGCC 
CTGGAGAGTTCCTCTGTTCTGTGAATGGACTCTGTGTCCCTGCCTGTGATGGGGTCAAGGAC 
TGCCCCAACGGCCTGGATGAGAGAAi^CTGCGTTTGCAGAGCCACATTCCAGTGCAAAGAGGA 
CAGCACATGCATCTCACTGCCCAAG(3TCTGTGATGGGCAGCCTGATTGTCTCAACGGCAGCG 
ATGAAGAGCAGTGCCAGGAAGGGGTGCCATGTGGGACATTCACCTTCCAGTGTGAGGACCGG 
AGCTGCGTGAAGAAGCCCAACCCGC71GTGTGATGGGCGGCCCGACTGCAGGGACGGCTCGGA 
TGAGGAGCACTGTGACTGTGGCCTC'CAGGGCCCCTCCAGCCGCATTGTTGGTGGAGCTGTGT 
CCTCCGAGGGTGAGTGGCCATGGCAC3GCCAGCCTCCAGGTTCGGGGTCGACACATCTGTGGG 
GGGGCCCTCATCGCTGACCGCTGGGTGATAACAGCTGCCCACTGCTTCCAGGAGGACAGCAT 
GGCCTCCACGGTGCTGTGGACCGTGTTCCTGGGCAAGGTGTGGCAGAACTCGCGCTGGCCTG 
GAGAGGTGTCCTTCAAGGTGAGCCGC:CTGCTCCTGCACCCGTACCACGAAGAGGACAGCCAT 
GACTACGACGTGGCGCTGCTGCAGCTCGACCACCCGGTGGTGCGCTCGGCCGCCGTGCGCCC 
CGTCTGCCTGCCCGCGCGCTCCCACITCTTCGAGCCCGGCCTGCACTGCTGGATTACGGGCT 
GGGGCGCCTTGCGCGAGGGCGGCCCCIATCAGCAACGCTCTGCAGAAAGTGGATGTGCAGTTG 
ATCCCACAGGACCTGTGCAGCGAGGGCTATCGCTACCAGGTGACGCCACGCATGCTGTGTGC 
CGGCTACCGCAAGGGCAAGAAGGATGICCTGTCAGGGTGACTCAGGTGGTCCGCTGGTGTGCA 
AGGCACTCAGTGGCCGCTGGTTCCTC^GCGGGGCTGGTCAGCTGGGGCCTGGGCTGTGGCCGG 
CCTAACTACTTCGGCGTCTACACCCGCATCACAGGTGTGATCAGCTGGATCCAGCAAGTGGT 
GACCTGAGGAACTGCCCCCCTGCAAJIGCAGGGCCCACCTCCTGGACTCAGAGAGCCCAGGGC 
AACTGCCAAGCAGGGGGACAAGTAT 



GGACGAGGGCAGATCTCGTTCTGGGGCAAGCCGTTGACACTCGCTCCCTGCCACCGCCCGGG 
CTCCGTGCCGCCAAGTTTTCATTTTCCACCTTCTCTGCCTCCAGTCCCCCAGCCCCTGGCCG 
AGAGAAGGGTCTTACCGGCCGGGATTGCTGGAAACACCAAGAGGTGGTTTTTGTTTTTTAAA 
ACTTCTGTTTCTTGGGAGGGGGTGTGGCGGGGCAG GATGA GCAACTCCQTTCCTCTGCTCTG 
TTTCTGGAGCCTCTGCTATTGCTTTGCTGCGGGGAGCCCCGTACCTTTTGGTCCAGAGGGAC 
GGCTGGAAGATAAGCTCCACAAACC'CIAAAGCTACACAGACTGAGGTCAAACCATCTGTGAGG 
TTTAACCTCCGCACCTCCAAGGACCCAGAGCATGAAGGATGCTACCTCTCCGTCGGCCACAG 
CCAGCCCTTAGAAGACTGCAGTTTC.?iACATGACAGCTAAAACCTTTTTCATCATTCACGGAT 
GGACGATGAGCGGTATCTTTGAAAACTGGCTGCACAAACTCGTGTCAGCCCTGCACACAAGA 
GAGAAAGACGCCAATGTAGTTGTGGTTGACTGGCTCCCCCTGGCCCACCAGCTTTACACGGA 
TGCGGTCAATAATACCAGGGTGGTGGGACACAGCATTGCCAGGATGCTCGACTGGCTGCAGG 
AGAAGGACGATTTTTCTCTCGGGAATGTCCACTTGATCGGCTACAGCCTCGGAGCGCACGTG 
GCCGGGTATGCAGGCAACTTCGTGAi^AGGAACGGTGGGCCGAATCACAGGTTTGGATCCTGC 
CGGGCCCATGTTTGAAGGGGCCGACATCCACAAGAGGCTCTCTCCGGACGATGCAGATTTTG 
TGGATGTCCTCCACACCTACACGCGTTCCTTCGGCTTGAGCATTGGTATTCAGATGCCTGTG 
GGCCACATTGACATCTACCCCAATGGGGGTGACTTCCAGCCAGGCTGTGGACTCAACGATGT 
CTTGGGATCAATTGCATATGGAACAZ^TCACAGAGGTGGTAAAATGTGAGCATGAGCGAGCCG 
TCCACCTCTTTGTTGACTCTCTGGTGAATCAGGACAAGCCGAGTTTTGCCTTCCAGTGCACT 
GACTCCAATCGCTTCAAT^AAGGGGATCTGTCTGAGCTGCCGCAAGAACCGTTGTAATAGCAT 
TGGCTACAATGCCAAGAAAATGAGGi^CAAGAGGAACAGCAAAATGTACCTAAAAACCCGGG 
CAGGCATGCCTTTCAGAGGTAACCTTCAGTCCCTGGAGTGTCCCTGAGGAAGGCCCTTAATA 
CCTCCTTCTTAATACCATGCTGCAGAGCAGGGCACATCCTAGCCCAGGAGAAGTGGCCAGCA 
CAATCCAATCAAATCGTTGCAAATCAGATTACACTGTGCATGTCCTAGGAAAGGGAATCTTT 
ACAAAATAAACAGTGTGGACCCCTAJlTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAA 



B'IGURE 66 



></usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA49646 
xsubunit 1 of 1, 354 aa, 1 stop 
xMW: 39362, pi: 8.35, NX(S/T) : 2 

MSNSVPLLCFWSLCYCFAAGSPVPFGPEGRLEDKLHKPKATQTEVKPSVRFNLRTSKDPEHE 
GCYLSVGHSQPLEDCSFmTAKTFFIIHGWTMSGIFENWLHKLVSALHTREKDANVVVVDWL 
PLAHQLYTDAVNNTRWGHSIARMLDWLQEKDDFSLGNVHLIGYSLGAHVAGYAGNFVKGTV 
GRITGLDPAGPMFEGADIHKRLSPDDADFVDVLHTYTRSFGLSIGIQMPVGHIDIYPNGGDF 
QPGCGLNDVLGS lAYGTITEWKCEHERAVHLFVDSLVNQDKPSFAFQCTDSNRFKKGI CLS 
CRKNROTSIGYNAIOayiRNKRMSKMYLKTRAGMPFRGNLQSLECP 

Important features: 
Signal peptide: 

amino acids 1-16 

Lipases, serine active site. 

amino acids 163-172 

N-glycosylation sites. 

amino acids 80-83 and 136-139 



CGGACGCGTGGGCGGACGCGTGGGCCTGGGCAAGGGCCGGGGCGCCGGGCCGAGCCACCTCTTCCCCTCCCCCGC 

TTCCOTGTCGCGCTCCGCTGGCTGGACGCGCTGGAGGAGTGGAGCAGCACCCGGCCGGCCCTGGGGGCTGACAGT 

CGGCaAAGTTTGGCCCGAAQAGGAAGTGGTCTC?^CCCCGGCAGGTGGCGACCAGGCC7^GACCAGGGGC^^ 

CTGCCTGCGGGCGGGCTGTAGGCGAGGGCGCGCCCCAGTGCCGAGACCCGGGGCTTCAGGAGCCGGCCCCGGGAG 

AGAAGAGTGCGGCGGCGGACGGAGAAAACA^CTCCyU^GTTGGCGaAAGGCACCGCCCCTACTCCCGGGCTGCCG 

CCGCCTCCCCGCCCCCAGCCCTGGCATCCAGAGTACGGGTCGAGCCCGGGCCATGGAGCCCCCCTGGGGAGGCGG 

CACCAGGGAGCCTGGGCGCCCGGGGCTCCGCCGCGACCCCATCGGGTAGACCACAGAAGCTCCGGGACCCTTCCG 

GCACCTCTGGACAGCCCAGGATGCTGTTGGCCACCCTCCTCCTCCTCCTCCTTGGAGGCGCTCTGGCCCATCCAG 

ACCGQATTATTTTTCCAAATCATGCTTGTGAGGACCCCCCAGCAGTGCTCTTAGAAGTGCAGGGCACCTTACAGA 

GGCCCCTGGTCCGGGACAGCCGCACCTCCCCTGCCAACTGCACCTGGCTCATCCTGGGCAGCAAGGAACa^^^ 

TO^CCATCAGGTTCCAGAAGCTACACCTGGCCTGTGGCTCAGAGCGCTTAACCCTACGCTCCCCTCTCCAGCCAC 

TGATCTCCCTGTGTGAGGCACCTCCCy^GCCCTCTGCAGCTGCCCGGGGGCAACGTCACCATCACTTACAGCTATG 

CTGGGGCCAGAGCykCCCATGGGCCAGGGCrrCCTGCTCTCCTACAGCCAAGATTGGCTGATGTGCCTGCAGGAAG 

AGTTTCAGTGCCTGAACCACCGCTGTGTATCTGCTGTCCAGCGCTGTGATGGGGTTGATGCCTGTGGCGATGGCT 

CTQATGAAGCAGGTTGCAGCTCAGACCCCTTCCCTGGCCTGACCCCAAGACCCGTCCCCTCCCTGCCTTGCAATG 

TCaCCTTGGAGGACTTCTATGGGGTCTTCTCCTCTCCTGGATATACACaCCTAGCCTCAGTCTCCCACCCCCAGT 

CCTGCCATTGGCTGCTGGACCCCCATGATGGCCGGCGGCTGGCCGTGCGCTTCACAGCCCTGGACTTGGGCTTTG 

GAGATGCAGTGCATGTGTATGACGGCCCT03GCCCCCTGAGAGCTCCCGACTACTGCGTAGTCTCACCCACTTCA 

GCaATGGCaAGGCTGTCACTGTGGAGACACTGTCTGGCCAGGCTGTTGTGTCCTACCACACAGTTGCTTGGAGCA 

ATGGTCGTGGCTTCAATGCCACCTACGATGTGCGGGGCTATTGCTTGCCTTGGGACAGACCCTGTGGCTTAGGCT 

CTGGCCTGGGAGCTGGCQAAGGCCTAGGTGi^GCGCTGCTACAGTGAGGCACAGCGCTGTGACGGCTCATGGGACT 

GTGCTGACGGOlCAGATGAGGAGQAOTGCaiAGGCTGCCCACOTGGAaiCTTCCCCTGTGGGGCTGCTGGCAC^ 

CTGGTGCCACAGCCTGCTACCTGCCTGCTG/^CCGCTGCAACTACCAGACTTTCTGTGCTGATGGAGCAGATGAGA 

GACGCTGTCGGCATTGCCAGCCTGGCaATTTCCGATGCCGGGACGAGAAGTGCGTGTATGAQACGTGGGTGTGCG 

ATGGGCAGCCAGACTGTGCGGACGGCAGTGiSLTGAGTGGGACTGCTCCTATGTTCTGCCCCGCAAGGTCATTACAG 

CTGCAGTCATTGGCAGCCTAGTGTGCGGCCTGCTCCTGGTCATCGCCCTGGGCTGCACCTGCAAGCTCTATGCCA 

TTCGCACCCAGGAGTAa^GCATCTTTGCCCCCCTCTCCCGGATGGAGGCTGAGATTGTGCAGCAGCAGGCACCCC 

CTTCCTACGGGCaGCTCATTGCCCAGGGTGCCATCCCyVCCTGTAGAAGACTTTCCTACAGAGAATCCTAATGATA 

ACTCAGTGCTGGGCaACCTGCGTTCTOTGCTACAGATCTTACGCaVGQATATGACTCCaGGAGGTGGCCaAG^^ 

CCCGCCGTCGTCAGCGGGGCCGCTTGATGCGACGCCTGGTACGCCGTCTCCGCCGCTGGGGCTTGCTCCCTCGAA 

CCAACACCCCGGCTCGGGCCTCTGAGGCCAGATCCCAGGTCACACCTTCTGCTGCTCCCCTTGAGGCCCTAGATG 

GTGGCACAGGTCCAGCCCGTGAGGGCGGGGCiaLGTGGGTGGGCAAGATGGGGAGCAGGCACCCCCACTGCCCATCA 

AGGCTCCCCTCCCATCTGCTAGCACGTCTCCAGCCCCCACTAOTGTCCCTGAAGCCCCAGGGCCACTGCCCTCAC 

TGCCCCTAGAGCaVTCTlCTATTGTCTGGAGTGGTGCAGGCCCTGCQAGGCCGCCTGTTGCCCAGCCTGGGGCCCC 

C::A.GGACC:SU^CCCGGAGCCCCCCTGGACCCa^CACAGCAGTCCrGGCCCTGQAAGATGAGGACGATGTGCTAC^^ 

TGCCACTGGCTGAGCCGGGGGTGTGGGTAGCTGAGGCAGAGGATGAGCCACTGCTTACCT^GGGGACCTGGGGG 

CTCTACTGAGGCCTCTCCCCTGGGGGCTCT7tCTCATAGTGGCACAACCTTTTAQAGGTGGGTCAGCCTCCCCTCC 

ACCACTTCCTTCCCTGTCCCTGGATTTCAGCX3ACTTGGTGGGCCTCCCGTTGACCCTATGTAGCTGCTATAAAGT 

TAAGTGTCCCTa^GGCAGGGAGAGGGCTCAC:AGAGTCTCCTCTGTACGTGGCCATGGCCAGACACCCCAGTCCCT 

TCACCACCaCCTGCTCCCCACGCCSVCCACaOTTGGGTGGCTGTTTTTAAAAAGTAAAGTTCTTAGAGGATCATA 

GGTCTGGACACTCCATCCTTGCCAAACCTCTACCCAAAAGTGGCCTTAAGCACCGQAATGCCAATTAACTAGAGA 

CCCTCCAGCCCCCaAGGGGAGGATTTGGGCTiGAACCTGAGGTTTTGCCATCCACAATCCCTCCTACAGGGCCTGG 

CTCACAAAAAGAGTGCAACAAATGCTTCTATTCCATAGCTACGGCATTGCTCAGTAAGTTGAGGTCAAAAATAAA 

GGAATCATACATCTC 



FIGURE 68 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA49631 
<subunit 1 of 1, 713 aa, 1 stop 
<MW: 76193, pi: 5.42, NX(S/T) : 4 

MLLATLLLLLLGGALAHPDRIIFPNHACEDPPAVLLEVQGTLQRPLVRDSRTSPANCTWLIL 
GSKEQTVTIRFQKLHLACGSERLTLRSPLQPLISLCEAPPSPLQLPGGNVTITYSYAGARAP 
MGQGFLLSYSQDWLMCLQEEFQCLMHRCVSAVQRCDGVDACGDGSDEAGCSSDPFPGLTPRP 
VPSLPCNVTLEDFYGVFSSPGYTHLASVSHPQSCHWLLDPHDGRRLAVRFTALDLGFGDAVH 
VYDGPGPPESSRLLRSLTHFSNGKAVTVETLSGQAWSYHTVAWSNGRGFNATYHVRGYCLP 
WDRPCGLGSGLGAGEGLGERCYSEAQRCDGSWDCADGTDEEDCPGCPPGHFPCGAAGTSGAT 
ACYLPADRC]SrYQTFCADGADERRCKHCQPGNFRCRDEKCVYETWCDGQPD<IM)GSDEWDCS 
YVLPRKVITAAVIGSLVCGLLLVIALGCTCKLYAIRTQEYSIFAPLSRMEAEIVQQQAPPSY 
GQLIAQGAIPPVEDFPTENPNIDNSVLGNLRSLLQILRQDMTPGGGPGARRRQRGRLMRRLVR 
RLRRWGLLPRTNTPARASEARSQVTPSAAPLEALDGGTGPAREGGAVGGQDGEQAPPLPIKA 
PLPSASTSPAPTTVPEAPGPLPSLPLEPSLLSGWQALRGRLLPSLGPPGPTRSPPGPHTAV 
LALEDEDDVLLVPLAEPGVWVAEAEDEPLLT 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 442-462 

LDL-receptor class A (LDLRA) domain proteins 
amino acids 411-431, 152-171, 331-350 and 374-393 



FIGURE 



CGAGCTGGGCGAGAAGTAGGGGAGGGCGGTGCTCCGCCGCGGTGGCGGTTGCTATCGCTTCG 
CAGAACCTACTCAGGCAGCCAGCTGAGAAGAGTTGAGGGAAAGTGCTGCTGCTGGGTCTGCA 
GACGCGATGGATAACGTGCAGCCGA?^TAAAACATCGCCCCTTCTGCTTCAGTGTGAAAGG 
CCACGTGAAGATGCTGCGGCTGGCACTAACTGTGACATCTATGACCTTTTTTATCATCGCAC 
AAGCCCCTGAACCATATATTGTTATCACTGGATTTGAAGTCACCGTTATCTTATTTTTCATA 
CTTTTATATGTACTCAGACTTGATCGATTAATGAAGTGGTTATTTTGGCCTTTGCTTGATAT 
TATCAACTCACTGGTAACAACAGTATTCATGCTCATCGTATCTGTGTTGGCACTGATACCAG 
AAACCACAACATTGACAGTTGGTGGAGGGGTGTTTGCACTTGTGACAGCAGTATGCTGTCTT 
GCCGACGGGGCCCTTATTTACCGGA?IGCTTCTGTTCAATCCCAGCGGTCCTTACCAGAAAAA 
GCCTGTGCATGAAAAAAAAGAAGTTTTGTAATTTTATATTACTTTTTAGTTTGATACTAAGT 
ATTAAACATATTTCTGTATTCTTCCJiAAAAAAAAAAAAAAAAA 



></usr/seqdb2/sst/DNA/Dnasegs .tnin/ss .DNA49645 
xsiibunit 1 of 1, 152 aa, 1 stop 
xMW: 17170, pi: 9.62, NX(S/T): 1 

i^]wqpkikhrpfcfsvkghvkmlr:laltvtsmtffiiaqapepyivitgfevtvilffill 
yvlrldrlmkwlfwplldi inslvt-icvfiyelivsvlali petttltvgggvfalvtavcclad 
galiyrkllfnpsgpyqkkpvhekkevl 

Important features: 

Potential type II transmenibreuie domain: 

amino acids 26-42 

Other potential transmembrane domain: 

amino acids 44-65, 81-101 and 109-129 

Leucine zipper pattern 

amino acids 78-99 and 85-106 

N-myristoylation site. 

amino acids 110-115 

Ribonucleotide reductase large subunit protein 

amino acids 116-127 



GGGCGAGAAGTAGGGGAGGGCGTGTTCCGCCGCGGTGGCGGTTGCTATCGTTTTGCAGAACC 
TACTCAGGCAGCCAGNTGAGAAGAGTTGAGGGAAAGTGCTGCTGCTGGGTCTGCAGACGCGA 
TGGATAACGTGCAGCCGAAAATAAAi?^CATCGCCCCTTCTGCTTCAGTGTGAAAGGCCACGTG 
AAGATGCTGCGGCTGGCACTAACTGNGACATCTATGACCTTTTTTATNATCGCACAAGCCCC 
TGAACCATATATTGTTATCACTGGATTTGAAGTCACCGTTATCTTATTTTTCATACTTTTAT 
ATGTACTCAGACTTGATCGATTAATGAAGTGGTTATTTTGGCCTTTGCTTGATATTATCAAC 
TCACTGGTAACAACAGTATTCATGCTCATCGTATCTGTGTTGGCACTGATACCAGAAACCAC 
AACATTGACAGTTGGTGGAGGGGTGTTTGCACTTGTGACAGCAGTATGCTGTNTTGCCGAC 



CAGCCCCGCGCGCCGGCCGAGTCGCTGAGCCGCGGCTGCCGGACGGGACGGGACCGGCTAGG 
CTGGGCGCGCCCCCCGGGCCCCGCCGTGGGCATGGGCGCACTGGCCCGGGCGCTGCTGCTGC 
CTCTGCTGGCCCAGTGGCTCCTGCGCGCCGCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTG 
CCCCTCCGGGTGGCCGCGGCCACGAACCGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCC 
TGCCGAGCGCCACGCCGACGGCTTGGCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGG 
GCGCCGCCAACTTCTTGGCCATGGTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTAC 
CTGGAGATGCTGATCGGGACCCCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAG 
TAACTTTGCCGTGGCAGGAACCCCGCACTCCTACATAGACACGTACTTTGACACAGAGAGGT 
CTAGCACATACCGCTCCAAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACG 
GGCTTCGTTGGGGAAGACCTCGTCA.CCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAA 
CATTGCCACTATTTTTGAATCAGAG;?y^TTTCTTTTTGCCTGGGATTAAATGGAATGGAATAC 
TTGGCCTAGCTTATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCC 
CTGGTGACACAAGCAAACATCCCCA?^CGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCC 
CGTTGCTGGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGT 
ATjyyiGGAGACATCTGGTATACCCCTATTAAGGAAGAGTGGTACTACCAGATAGAAATTCTG 
AAATTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGC 
CATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG6 
AAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCCCAG 
CTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATCTACCT 
GAGAGACGAGAACTCCAGCAGGTCATTCCGTATCACAATCCTGCCTCAGCTTTACATTCAGC 
CCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCATCCACAAAT 
GCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGACAGAGCCCAGAA 
GAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCAGTGTCTGAAATTT 
CCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCTCAGTCTTTGAGCGAG 
CCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGAGCCATCCTCCTTGTCTT 
AATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCCCGTGACCCTGAGGTCGTCA 
ATGATGAGTCCTCTCTGGTCAGACATCGCTGGAAATGAATAGCCAGGCCTGACCTCAAGCAA 
CCATGAACTCAGCTATTAAGAAAAT(3ft.CATTTCCAGGGCAGCAGCCGGGATCGATGGTGGCG 
CTTTCTCCTGTGCCCACCCGTCTTCi^TCTCTGTTCTGCTCCCAGATGCCTTCTAGATTCAC 
TGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATCCTCCCTACTTCCAAGAAAAATAATTAA 
AAAAAAAACTTCATTCTAA 



FIGURE 73 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA45493 
xsubunit 1 of 1, 518 aa, 1 stop 
xMW: 56180, pi: 5.08, NX(S/T): 2 

MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGLAL 
ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHS 
YIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENF 
FLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGSGTNGGS 
LVLGGIEPSLYKGDIWYTPIKEEWYTrQIEILKLEIGGQSLNLDCREYHADKAIVDSGTTLLR 
LPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFR 
ITILPQLYIQPMMGAGLNYEClfRFGISPSTNALVIGATVMEGFYVIFDRAQKRVGFAASPCA 
EIAGAAVSEISGPFSTEDVASNCVPAQSLSEPILWIVSYALMSVCGAILLVLIVLLLLPFRC 
QRRPRDPEWNDES SLVRHRWK 

Important features: 
Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 466-494 

N-glycosylation sites. 

amino acids 170-173 and 366-369 

Leucine zipper pattern.. 

amino acids 10-31 and 197-118 

Eukaryotic and viral aspartyl proteases 

amino acids 109-118, 252-261 and 298-310 



FIGURE 74 

CGCCTCCGCCTTCGGAGGCTGACGCGCCCGGGCGCCGTTCCAGGCCTGTGCAGGGCGGATCG 
GCAGCCGCCTGGCGGCGATCCAGGGCGGTGCGGGGCCTGGGCGGGAGCCGGGAGGCGCGGCC 
GGCATGGAGGCGCTGCTGCTGGGCGCGGGGTTGCTGCTGGGCGCTTACGTGCTTGTCTACTA 
CAACCTGGTGAAGGCCCCGCCGTGCGGCGGCATGGGCAACCTGCGGGGCCGCACGGCCGTGG 
TCACGGGCGCCAACAGCGGCATCGGAAAGATGACGGCGCTGGAGCTGGCGCGCCGGGGAGCG 
CGCGTGGTGCTGGCCTGCCGCAGCCAGGAGCGCGGGGAGGCGGCTGCCTTCGACCTCCGCCA 
GGAGAGTGGGAACAATGAGGTCATCTTCATGGCCTTGGACTTGGCCAGTCTGGCCTCGGTGC 
GGGCCTTTGCCACTGCCTTTCTGAGCTCTGAGCCACGGTTGGACATCCTCATCCACAATGCC 
GGTATCAGTTCCTGTGGCCGGACCCGTGAGGCGTTTAACCTGCTGCTTCGGGTGAACCATAT 
CGGTCCCTTTCTGCTGACACATCTGCTGCTGCCTTGCCTGAAGGCATGTGCCCCTAGCCGCG 
TGGTGGTGGTAGCCTCAGCTGCCCAGTGTCGGGGACGTCTTGACTTCAAACGCCTGGACCGC 
CCAGTGGTGGGCTGGCGGCAGGAGCTGCGGGCATATGCTGACACTAAGCTGGCTAATGTACT 
GTTTGCCCGGGAGCTCGCCAACCAGGTTGAGGCCACTGGCGTCACCTGCTATGCAGCCCACC 
CAGGGCCTGTGAACTCGGAGCTGTTCCTGCGCCATGTTCCTGGATGGCTGCGCCCACTTTTG 
CGCCCATTGGCTTGGCTGGTGCTCC(3GGCACCAAGAGGGGGTGCCCAGACACCCCTGTATTG 
TGCTCTACAAGAGGGCATCGAGCCCGTCAGTGGGAGATATTTTGCCAACTGCCATGTGGAAG 
AGGTGCCTCCAGCTGCCCGAGACGACCGGGCAGCCCATCGGCTATGGGAGGCCAGCAAGAGG 
CTGGCAGGGCTTGGGCCTGGGGAGGATGCTGAACCCGATGAAGACCCCCAGTCTGAGGACTC 
AGAGGCCCCATCTTCTCTAAGCACCCCCCACCCTGAGGAGCCCACAGTTTCTCAACCTTACC 
CCAGCCCTCAGAGCTCACCAGATTTGTCTAAGATGACGCACCGAATTCAGGCTAAAGTTGAG 
CCTGAGATCCAGCTCTCCTAACCCTCAGGCCAGGATGCTTGCCATGGCACTTCATGGTCCTT 
GAAAACCTCGGATGTGTGTGAGGCCATGCCCTGGACACTGACGGGTTTGTGATCTTGACCTC 
CGTGGTTACTTTCTGGGGCCCCAAGCTGTGCCCTGGACATCTCTTTTCCTGGTTGAAGGAAT 
AATGGGTGATTATTTCTTCCTGAGAGTGACAGTAACCCCAGATGGAGAGATAGGGGTATGCT 
AGACACTGTGCTTCTCGGAAATTTGGATGTAGTATTTTCAGGCCCCACCCTTATTGATTCTG 
ATCAGCTCTGGAGCAGAGGCAGGGAGTTTGCAATGTGATGCACTGCCAACATTGAGAATTAG 
TGAACTGATCCCTTTGCAACCGTCTAGCTAGGTAGTTAAATTACCCCCATGTTAATGAAGCG 
GAATTAGGCTCCCGAGCTAAGGGACTCGCCTAGGGTCTCACAGTGAGTAGGAGGAGGGCCTG 
GGATCTGAACCCAAGGGTCTGAGGCCAGGGCCGACTGCCGTAAGATGGGTGCTGAGAAGTGA 
GTCAGGGCAGGGCAGCTGGTATCGAGGTGCCCCATGGGAGTAAGGGGACGCCTTCCGGGCGG 
ATGCAGGGCTGGGGTCATCTGTATCTGAAGCCCCTCGGAATAAAGCGCGTTGACCGCCAAAA 
AAAAAAAAAAAAAAAAA 



</usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA48227 
<subunit 1 of 1, 377 aa, I stop 
<MW: 40849, pi: 7.98, NX{S/T): 0 

mealllgaglllgayvlvyynlvkappcggmgnlrgrtawtgansgigkmtalelarrgar 

wlacrsqergeaaafdlrqesgnnevifmaldlaslasvrafataflsseprldilihnag 

isscgrtreafnlllrvnhigpfllthlllpclkacapsrwwasaahcrgrldfkrldrp 

wgwrqelrayadtkijywlfarek?mqleatgvtcyaahpgpvnselflrhvpgwlr^ 

plawlvlraprggaqtplycalqeg3:eplsgryfanchveevppaarddraahrlweaskrl 

aglgpgedaepdedpqsedseapsslstphpeeptvsqpypspqsspdlskmthriqakvep 

EIQLS 

Important features: 
Signal peptide: 
amino acids 1-16 

Glycosaminoglycan attachment site. 

amino acids 46-49 

Short -chain alcohol dehydrogenase family 

amino acids 37-49 and 114-124 



FIGURE 76 



GGAGGAGACAGCCTCCTGGGGGGCAGGGGTTCCCTGCCTCTGCTGCTCCTGCTCATCATGGGAGG 

GACTCCCCGCCCCAGATCCTAGTCCACCCCcmGGACCAGCTGTTCCAGGGCCCTGGCCCTGCCAGGATGAGCTGC 

CAAGCCTCy^GGCCAGCCACOTCCCACCATCCGCTGGTTGCTGAATGGGCAGCCCCTGAGCaTGGTGCCCCCA 

CCAC:ACC:a.CCTCCTGCCTQATGGGACCCTTCTGCTGCTACaGCCCCCTGCCCGGGGACa.TGCCCACGATGGCCm^ 

GCCCTGTCCACaGACCTGGGTGTCTACACATGTGAGGCCAGCT^CCGGCTTGGCACGGCAGTCAGCAGAGGC^ 

CGGCTGTCTGTG6CTGTCCTCCGGGAGGATTTCCA6ATCCAGCCTCGGGACATGGTGGCTGTGGTGGGTQAGCAG 

TTTACTCTGGAATGTGGGCCGCCCTGGGGCC:!ACCCAGAGCCCACAGTCTCATGGTGGAAAGATGGGAAACCCCTG 

GCCCTCCAGCCCGGAAGGCACACAGTGTCCC5GGGGGTCCCTGCTGATGGCAAQAGCAGAGAAGAGTGACGAAGGG 

ACCTACATGTGTGTGGCOVCCAACAGCGCAC^CATAGGGAGAGCCGCGCAGCCCGGGTTTCCATCCAGGAGCCC 

CAGGACTACACGGAGCCTGTGGAGCTTCTGCJCTGTGCGAATTCAGCTGGAAAATGTGACACTGCTGAACCCGGAT 

CCTGCAGAGGGCCCCAAGCCTAGACCGGCGC5TGTGGCTCAGCTGGAAGGTCAGTGGCCCTGCTGCGCCTGCCCAA 

TCTTACACGGCCTTGTTCAGGACCCAGACTGCCCCGGQAGGCCAGGGAGCTCCGTGGGCAGAGGAGCTGCTGGCC 

ggctggcagagcgcagagcttggaggcctcc:actggggccaagactacgagttcaaagtgagaccatcctctggc 

CGGGCTCGAGGCCCTGACAGCAAC6TGCTGCTCCTGAGGCTGCCGGAAAAAGTGCCCAGTGCCCCACCTCAGGAA 

gtgactctaaagcctggcaatggcactgtctttgtgagctgggtcccaccacctgctgaaaaccacaatggcatc 

ATCCGTGGCTACCAGGTCTGGAGCCTGGGOIAa^CATCACTGCCACCAGCCAACTGGACTGTAGTTGGTGAGC^ 

ACCCy^GCTGGAAATCGCCaCCCATATGCCAGIGCTCCTACTGCGTGCAAGTGGCTGCAGTCACTGGTGCTGGAGCT 

GGGGAGCCC^GTAGACCTGTCTGCCTCCTTl'TAGAGCAGGCCATGGAGCGAGCCACCCAAGAACCCAGTGAGCAT 

GGTCCCTGGACCCTGGAGCAGCTQAGGGCTACCTTGAAGCGGCCTGAGGTCATTGCCACCTGCGGTGTTGCACTC 

TGGCTGCTGCTTCTGGGCACCGCCGTGTGTATCCACCGCCGGCGCCQAGCTAGGGTGCACCTGGGCCCAGGTCTG 

TAOlGATATACCAGTQAGGATGCCATCCTAaAACACAGGATGGATCACAGTGACTCCCAGTGGTTGGC^ 

TGGCGTTCCACCTCTGGCTCTCGGGACCTGa.GCAGCAGCAGCAGCCTCAGCAGTCGGCTGGGGGCGGATGCCCGG 

GACCCACTAGACTGTCGTCGCTCCTTGCTCTCCTGGGACTCCCGAAGCCCCGGCGTGCCCCTGCTTCCAGACACC 

agcactttttatggctccctcatcgctgagc:tgccctccagtaccccagccaggccaagtccccaggtcccagct 

GTCAGGCGCCTCCCACCCCAGCTGGCCCAGCTCTCCAGCCCCTGTTCCAGCTCAGACAGCCTCTGCAGCCGCAGG 

ggactctcttctccccgcttgtctctggcccctgcagaggcttggaaggccaaaaagaagcaggagctgcagcat 

GCCAACAGTTCCCCACTGCTCCGGGGCAGCCACTCCTTGGAGCTCCGGGCCTGTGAGTTAGGAAATAGAGGTTCC 

aagaacctttcccaaagcccaggagctgtgccccaagctotggttgcctggcgggccctgggaccgaaactcctc 

AGCTCCTCAAATGAGCTGGTTACTCGTCATCTCCCTCCAGCACCCCTCTTTCCTCATGAAACTCCCCCAACTCAG 

AGTC^ACAGACCaiGCCTCCGGTGGCACCACAGGCTCCCTCCTCCATCCTGCTGCCAGCAGCCCCCATCCCCATC 

CTTAGCCCCTGCAGTCCCCCTAGCCCCCAGGCCTCTTCCCTCTCTGGCCCCAGCCCAGCTTCCAGTCGCCTGTCC 

AGCTCCTCACTGTCATCCCTGGGGGAGQATCAAGACAGCGTGCTGACCCCTGAGGAGGTAGCCCTGTGCTTGGAA 

CTCAGTQAGGGTGAGGAGACTCCCAGGAACAGCGTCTCTCCCATGCCAAGGGCTCCTTCACCCCCCACCACCTAT 

GGGTACATCAGCGTCCCAACAGCCTCAGAGTTCACGGACATGGGCAGGACTGGAGGAGGGGTGGGGCCCAAGGGG 

GGAGTCTTGCTGTGCCCACCTCGGCCCTGCCTCACCCCCACCCCCAGCGAGGGCTCCTTAGCCAATGGTTGGGGC 

TCAGCCTCTGAGGAG?^TGCCGCCaGCGCCa.GAGCCAGCCTTGTCAGCTCCTCCGATGGCTCCTTCCTCGCTGAT 

GCTCACTTTGCCCGGGCCCTGGCAGTGGCTGTGGATAGCTTTGGTTTCGGTCTAGAGCCCAGGGAGGCAGACTGC 

GTCTTCATAGATGCCTCATCACCTCCCTCCCCACGGGATGAGATCTTCCTGACCCCCAACCTCTCCCTGCCCCTG 

TGGGAGTGGAGGCCAGACTGGTTGGAAGACATGGAGGTCAGCCACACCCAGCGGCTGGGAAGGGGGATGCCTCCC 

TGGCCCCCTGACTCTCAGATCTCTTCCCAGAQAAGTCAGCTCCACTGTCGTATGCCCAAGGCTGGTGCTTCTCCT 

GTAGATTACTCCTGAACCGTGTCCCTGAGACTTCCCAGACGGGAATCAGAACCACTTCTCCTGTCCACCCAa^ 

ACCTGGGCTGTG6TGTGTGGGTCTTGGCCTGTGTTTCTCTGCAGCTGGGGTCCACCTTCCCAAGCCTCCAGAGAG 

TTCTCCCTCCACGATTGTGAAAACAAATGAA!UICAAAATTAGAGC»AAGCTGACCTGGAGCCCTCAGGGAGC^ 

ACATCATCTCCACCTGACTCCTAGCCACTGCTTTCTCCTCTGTGCCATCCACTCCCACCACCAGGTTGTTTTGGC 

CTGAGGAGCAGCCCTGCCTGCTGCTCTTCCCCCACCATTTGGATCACAGGAAGTGGAGGAGCCAGAGGTGCCTTT 

GTGGAGGACAGCAGTGGCTGCTGGGAGAGGGCTGTGGAGGAAGGAGCTTCTCGGAGCCCCCTCTCAGCCTTACCT 

GGGCCCCTCCTCTAGAGAAGAGCTCAACTCTCTCCCAACCTCACCATGGAAAGAAAATAATTATGAATGCCACTG 

AGGCACTGAGGCCCTACCTCATGCCAAACAAi^GGGTTaiAGGCTGGGTCTAGCGAGGATGCTGAAGGAAGG^ 

TATGAGACCGTAGGTCAAAAGCACCATCCTCGTACTGTTGTCACTATGAGCTTAAGAAATTTGATACCATAAAAT 

GGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAiU^AAAAAAAA 



FIGURE 77 



< /us r / s eqdb 2 / s s t /DNA/Dna segs.min/ss- DNA4 1404 
<siibunit 1 of 1, 985 aa, 1 stop 
<MW: 105336, pi: 6.55, NX.(S/T): 7 

MGGMAQDSPPQ I LVHPQDQLFQGPGPARMS CQASGQPPPTIRWLLNGQPLSMVPPDPHHLLP 
DGTLLLLQPPARGHAHDGQALSTDIiGVYTCEASNRLGTAVSRGARLSVAVLREDFQIQPRDM 
VAWGEQFTLECGPPWGHPEPTVSRfWKDGKPLALQPGRHTVSGGSLLMARAEKSDEGTYMCV 
ATNSAGHRESRAARVSIQEPQDYTEPVELLAVRIQLENVTLLNPDPAEGPKPRPAVWLSWKV 
SGPAAPAQSYTALFRTQTAPGGQGAPWAEELLAGWQSAELGGLHWGQDYEFKVRPSSGRARG 
PDSNVLLLRLPEKVPSAPPQEVTLPCPGNGTVFVSWVPPPAENHNGIIRGYQVWSLGNTSLPP 
ANWTWGEQTQLEIATHMPGSYCVQVAAVTGAGAGEPSRPVCLLLEQAMERATQEPSEHGPW 
TLEQLRATLKRPEVIATCGVALWLLLLGTAVCIHRRRRARVHLGPGLYRYTSEDAILKHRMD 
HSDSQWLADTWRSTSGSRDLSSSSSLSSRLGADARDPLDCRRSLLSWDSRSPGVPLLPDTST 
FYGSLIAELPSSTPARPSPQVPAVRRLPPQLAQLSS PCS SSDSLCSRRGLS S PRLSLAPAEA 
WKAKKKQELQHANSSPLLRGSHSLEILRACELGNRGSKNLSQSPGAVPQALVAWRALGPKLLS 
SSNELVTRHLPPAPLFPHETPPTQSQQTQPPVAPQAPSSILLPAAPIPILSPCSPPSPQASS 
LSGPSPASSRLSSSSLSSLGEDQDSVLTPEEVALCLELSEGEETPRNSVSPMPRAPSPPTTY 
GYISVPTASEFTDMGRTGGGVGPKGGVLIiCPPRPCLTPTPSEGSLANGWGSASEDNAASARA 
SLVSSSDGSFLADAHFARALAVAVDSFGFGLEPREADCVFIDASSPPSPRDEIFLTPNLSLP 
LWEWRPDWLEDMEVSHTQRLGRGMPPWPPDSQISSQRSQIiHCRMPKAGASPVDYS 

Important features: 
Transmembrane domain: 
amino acids 448-467 
N-glycosylation sites: 

amino acids 224-227, 338-341, 367-370, 374-377, 658-661 and 926- 
929 

N-myristoylation sites. 

amino acids 47-52, 80-85, 88-93, 99-104, 105-110, 181-186, 272- 
277, 290-295, 355-360, 403-408, 462-467, 561-566, 652-657, 849- 
854 and 876-881 

Phospho tyro sine interaction domain proteins 

amino acids 740-753 



FIGURE 78 



CTCCCACGGTGTCCAGCGCCCAG Aj^TG CGGCTTCTGGTCCTGCTATGGGGTTGCCTGCTGCT 
CCCAGGTTATGAAGCCCTGGAGGGCCCAGAGGAAATCAGCGGGTTCGAAGGGGACACTGTGT 
CCCTGCAGTGCACCTACAGGGAAGAGCTGAGGGACCACCGGAAGTACTGGTGCAGGAAGGGT 
GGGATCCTCTTCTCTCGCTGCTCTCK5CACCATCTATGCAGAAGAAGAAGGCCAGGAGACAAT 
GAAGGGCAGGGTGTCCATCCGTGACAGCCGCCAGGAGCTCTCGCTCATTGTGACCCTGTGGA 
ACCTCACCCTGCAAGACGCTGGGGAGTACTGGTGTGGGGTCGAAAAACGGGGCCCCGATGAG 
TCTTTACTGATCTCTCTGTTCGTCTTTCCAGGACCCTGCTGTCCTCCCTCCCCTTCTCCCAC 
CTTCCAGCCTCTGGCTACAACACGCCTGCAGCCCAAGGCAAAAGCTCAGCAAACCCAGCCCC 
CAGGATTGACTTCTCCTGGGCTCTACCCGGCAGCCACCACAGCCAAGCAGGGGAAGACAGGG 
GCTGAGGCCCCTCCATTGCCAGGGACTTCCCAGTACGGGCACGAAAGGACTTCTCAGTACAC 
AGGAACCTCTCCTCACCCAGCGACCTCTCCTCCTGCAGGGAGCTCCCGCCCCCCCATGCAGC 
TGGACTCCACCTCAGCAGAGGACACCAGTCCAGCTCTCAGCAGTGGCAGCTCTAAGCCCAGG 
GTGTCCATCCCGATGGTCCGCATACTGGCCCCAGTCCTGGTGCTGCTGAGCCTTCTGTCAGC 
CGCAGGCCTGATCGCCTTCTGCAGCCACCTGCTCCTGTGGAGAAAGGAAGCTCAACAGGCCA 
CGGAGACACAGAGGAACGAGAAGTTCTGGCTCTCACGCTTGACTGCGGAGGAAAAGGAAGCC 
CCTTCCCAGGCCCCTGAGGGGGACGTGATCTCGATGCCTCCCCTCCACACATCTGAGGAGGA 
GCTGGGCTTCTCGAAQTTTGTCTCAGC GTAGG GCAGGAGGCCCTCCTGGCCAGGCCAGCAGT 
GAAGCAGTATGGCTGGCTGGATCAGCACCGATTCCCGAAAGCTTTCCACCTCAGCCTCAGAG 
TCCAGCTGCCCGGACTCCAGGGCTCTCCCCACCCTCCCCAGGCTCTCCTCTTGCATGTTCCA 
GCCTGACCTAGAAGCGTTTGTCAGCCCTGGAGCCCAGAGCGGTGGCCTTGCTCTTCCGGCTG 
GAGACTG6GACATCCCTGATAGGTTCACATCCCTGGGCAGAGTACCAGGCTGCTGACCCTCA 
GCAGGGCCAGACAAGGCTCAGTGGATCTGGTCTGAGTTTCAATCTGCCAGGAACTCCTGGGC 
CTCATGCCCAGTGTCGGACCCTGCCTTCCTCCCACTCCAGACCCCACCTTGTCTTCCCTCCC 
TGGCGTCCTCAGACTTAGTCCCACGGTCTCCTGCATCAGCTGGTGATGAAGAGGAGCATGCT 
GGGGTGAGACTGGGATTCTGGCTTCTCTTTGAACCACCTGCATCCAGCCCTTCAGGAAGCCT 
GTGAAAAACGTGATTCCTGGCCCCACCl^GACCCACCAAAACCATCTCTGGGCTTGGTGCAG 
GACTCTGAATTCTAACAATGCCCAGTGACTGTCGCACTTGAGTTTGAGGGCCAGTGGGCCTG 
ATGAACGCTCACACCCCTTCAGCTTAGAGTCTGCATTTGGGCTGTGACGTCTCCACCTGCCC 
CAATAGATCTGCTCTGTCT6CGACACCAGATCCACGTGGGGACTCCCCTGAGGCCTGCTAAG 
TCCAGGCCTTGGTCAGGTCAGGTGGkCATTGCAGGATAAGCCCAGGACCGGCACAGAAGTGG 
TTGCCTTTNCCATTTGCCCTCCCTGGNCCATGCCTTCTTGCCTTTGGAAAAAATGATGAAGA 
AAACCTTGGCTCCTTCCTTGTCTGGAAAGGGTTACTTGCCTATGGGTTCTGGTGGCTAGAGA 
GAAAAGTAGAAAACCAGAGTGCACGTAGGTGTCTAACACAGAGGAGAGTAGGAACAGGGCGG 
ATACCTGAAGGTGACTCCGAGTCCAGCCCCCTGGAGAAGGGGTCGGGGGTGGTGGTAAAGTA 

GCTGCCCAGGCTGGAGTGCAGTGGCACGATCTGCAAACTCCGCCTCCTGGGTTCAAGTGATT 
CTTCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACGCACCACCACACCTGGCTAATT 
TTTGTACTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGAC 
CTCAAATGAGCCTCCTGCTTCAGTCTCCCAAATTGCCGGGATTACAGGCATGAGCCACTGTG 
TCTGGCCCTATTTCCTTTAAAAAGTGAAATTAAGAGTTGTTCAGTATGCAAAACTTGGAAAG 
ATGGAGGAGAAAAAGAAAAGGAAGA^kAAAAATGTCACCCATAGTCTCACCAGAGACTATCAT 
TATTTCGTTTTGTTGTACTTCCTTCCACTCTTTTCTTCTTCACATAATTTGCCGGTGTTCTT 
TTTACAGAGCAATTATCTTGTATATACAACTTTGTATCCTGCCTTTTCCACCTTATCGTTCC 
ATCACTTTATTCCAGCACTTCTCTGTGTTTTACAGACCTTTTTATAAATAAAATGTTCATCA 
GCTGCATAAAAAAAAAAAAAA 



FIGURE 79 



</usr/seqdb2/sst/DNA/Dnasegs .min/ss .DNA44196 
<subunit 1 of 1, 332 aa, 1 stop 
<MW: 36143, pi: 5.89, NX{S/T) ; 1 

MRLLVLLWGCLLLPGYEALEGPEEISGFEGDTVSLQCTYREEIiRDHRKYWCRKGGILFSRCS 
GTIYAEEEGQETMKGRVSIRDSRQEljSLIVTLWNLTLQDAGEYWCGVEKRGPDESLLISLFV 
FPGPCCPPSPSPTFQPLATTRLQPiQ^KAQQTQPPGLTSPGLYPAATTAKQGKTGAEAPPLPG 
TSQYGHERTSQYTGTSPHPATSPPA(3SSRPPMQLDSTSAEDTSPALSSGSSKPRVSIPMVRI 
LAPVLVLLSLLSAAGLIAFCSHLLLWRKEAQQATETQRNEKFWLSRLTAEEKEAPSQAPEGD 
VISMPPLHTSEEELGFSKFVSA 

Important features: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 248-269 

N-glycosylation site. 

amino acids 96-99 

Fibrinogen beta and gamma chains C- terminal domain. 

amino acids 104-113 



Ig like V-type domain: 

amino acids 13-128 



FIGURE 80 



TTGTGACTAAAAGCTGGCCTAGCAGGCCAGGGAGTGCAGCTGCAGGCGTGGGGGTGGCAGGA 
GCCGCAGAGCCAGAGCAGACAGCCGAGAAACAGGTGGACAGTGTGAAAGAACCAGTGGTCTC 
GCTCTGTTGCCCAGGCTAGAGTGTAGTGGCGTGATCATAGCTCACTGCAGCCTCAGACTCCT 
GGACTTGAGAAATCCTCCTGCCTTAGCCTCCTGCATATCTGGGACTCCAGGGGTGCACTCAA 
GCCCTGTTTCTTCTCCTTCTGTGAGTGGACCACGGAGGCTGGTGAGCTGCCTGTCATCCCAA 
AGCTCAGCTCTGAGCCAGAGTGGTGC3TGGCTCCACCTCTGCCGCCGGCATAGAAGCCAGGAG 
CAGGGCTCTCAGAAGGCGGTGGTGCiCCAGCTGGGAT CATGT TGTTGGCCCTGGTCTGTCTGC 
TCAGCTGCCTGCTACCCTCCAGTGAC5GCCAAGCTCTACGGTCGTTGTGAACTGGCCAGAGTG 
CTACATGACTTCGGGCTGGACGGATJ^CCGGGGATACAGCCTGGCTGACTGGGTCTGCCTTGC 
TTATTTCACAAGCGGTTTCAACGCAGCTGCTTTGGACTACGAGGCTGATGGGAGCACCAACA 
ACGGGATCTTCCAGATCAACAGCCGGfAGGTGGTGCAGCAACCTCACCCCGAACGTCCCCAAC 
GTGTGCCGGATGTACTGCTCAGATTTGTTGAATCCTAATCTCAAGGATACCGTTATCTGTGC 
CATGAAGATAACCCAAGAGCCTCAGGGTCTGGGTTACTG6GAGGCCTGGAGGCATCACTGCC 
AGGGAAAAGACCTCACTGAATGGGTG'rGATGGCTGTGACTT CTAGG ATGGACGGAACCATGCA 
CAGCAGGCTGGGAAATGTGGTTTGGI^TCCTGACCTAGGCTTGGGAAGACAAGCCAGCGAATA 
AAGGATGGTTGAACGTGAAA 



FIGURE 81 



</usr/seqdb2/sst/DNA/Diiasegs.inin/ss.DNA52187 
<stibimit 1 of 1, 146 aa, 1 stop 
<MW: 16430, pi: 5.05, NX(S/T): 1 

MLLALVCLLSCLLPSSEAKLYGRCEILARVLHDFGLDGYRGYSLADWVCLAYFTSGFNAAALD 
YEADGSTNNGIFQINSRRWCSNLTPNVPNVCRMYCSDLLNPNLKDTVICAMKITQEPQGLGY 
WEAWRHHCQGKDLTEWVDGCDF 

Important features: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 67-72 

Homolgous region to Alpha- lac talbtamin / lysozyme C proteins. 

amino acids 34-58 {catalytic domain) , 111-132 and 66-107 



FIGURE 82 



agccgctgccccgggccgggcgcccgcggcggcacc atga gtccccgctcgtgcctgcgttc 
gctgcgcctcctcgtcttcgccgtcttctcagccgccgcgagcaactggctgtacctggcca 
agctgtcgtcggtggggagcatctc:agaggaggagacgtgcgaga2^ctcaagggcctgatc 
cagaggcaggtgcagatgtgcaagcggaacctggaagtcatggactcggtgcgccgcggtgc 
ccagctggccattgaggagtgccagtaccagttccggaaccggcgctggaactgctccacac 
tcgactccttgcccgtcttcggcaaggtggtgacgcaagggactcgggaggcggccttcgtg 
tacgccatctcttcggcaggtgtggfcctttgcagtgacgcgggcgtgcagcagtggggagct 
ggagaagtgcggctgtgacaggacagtgcatggggtcagcccacagggcttccagtggtcag 
gatgctctgacaacatcgcctacggtgtggccttctcacagtcgtttgtggatgtgcgggag 
agaagcaagggggcctcgtccagcagagccctcatgaacctccacaacaatgaggccggcag 
gaaggccatcctgacacacatgcgggtggaatgcaagtgccacggggtgtcaggctcctgtg 
aggtaaagacgtgctggcgagccgtgccgcccttccgccaggtgggtcacgcactgaaggag 
aagtttgatggtgccactgaggtggagccacgccgcgtgggctcctccagggcactggtacc 
acgcaacgcacagttcaagccgcacacagatgaggacctggtgtacttggagcctagccccg 
acttctgtgagcaggacatgcgcagcggcgtgctgggcacgaggggccgcacatgcaacaag 
acgtccaaggccatcgacggctgtgagctgctgtgctgtggccgcggcttccacacggcgca 
ggtggagctggctgaacgctgcagctgcaaattccactggtgctgcttcgtcaagtgccggc 
agtgccagcggctcgtggagttgcaciacgtgccgatgaccgcctgcctagccctgcgccggc 
aaccacctagtggcccagggaaggccgataatttaaacagtctcccaccacctaccccaaga 
gatactggttgtattttttgttctggtttggtttttgggtcctcatgttatttattgccgaa 
accaggcaggcaaccccaagggcacc'aaccagggcctccccaaagcctgggcctttgtggct 
gccactgaccaaagggaccttgctcgtgccgctggctgcccgcatgtggctgccactgacca 
ctcagttgttatctgtgtccgtttttctacttgcagacctaaggtggagtaacaaggagtat 
taccaccacatggctactgaccgtgtcatcggggaagagggggccttatggcagggaaaata 
ggtaccgacttgatggaagtcacacgctctggaaaaaagaactcttaactctccagcacaca 
tacacatggactcctggcagcttgagtcctagaagccatgtctctcaaatgccctgagaaagg 
gaacaagcagataccaggtcaagggcaccaggttcatttcagcccttacatggacagctaga 
ggttcgatatctgtgggtccttccaggcaagaagagggagatgagagcaagagacgactgaa 
gtcccaccctagaacccagcctgccccagcctgcccctgggaagaggaaacttaaccactcc 
ccagacccacctaggcaggcatataggctgccatcctggaccagggatcccggctgtgcctt 
tgcagtcatgcccgagtcacctttca.cagcgctgttcctccatgaaactgaaaaacacacac 
acacacacacacacacacacacacacacacacacacacggacacacacacacacctgcgaga 
gagagggaggaaagggctgtgcctttgcagtcatgcccgagtcacctttcacagcactgttcctc 



FIGURE 83 



< /us r / seqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA4 8328 
<siibunit 1 of 1, 351 aa, 1 stop 
<MW: 39052, pi: 8.97, NX(S/T): 2 

MSPRSCLRSLRLLVFAVFSAAASNWrLYLAKLSSVGSISEEETCEKLKGLIQRQVQMCKRNLE 
VIVDSWRGAQLAIEECQYQFRMRRVINCSTLDSLPVFGKVVTQGTREAAFVYAISSAGVAFAV 
TRACSSGELBKCGCDRTVHGVSPQGFQWSGCSDNIAYGVAFSQSFVDVRERSKGASSSRALM 
NLHNNEAGRKAILTHMRVECKCHGVSGSCEVKTCWRAVPPFRQVGHALKEKFDGATEVEPRR 
VGSSRALVPKNAQFKPHTDEDLVYLEPSPDFCEQDMRSGVLGTRGRTCNKTSKAIDGCELLC 
CGRGFHTAQVELAERCSCKFHWCCF7KCRQCQRLVELHTCR 

Important features: 
Signal peptide: 

amino acids 1-22 

N-glycosylation sites. 

amino acids 88-91 and 297--300 

Wnt-1 family signature. 

amino acids 206-215 

Homologous region to Wnt-1 family proteins 

amino acids 183-235, 305-350, 97-138, 53-92 and 150 -174 



FIGURE 84 



CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGGTGCCTGCAT 
CGCCATGGACACCACCAGGTACAGCAAGTGGGGCGGCAGCTCCGAGGAGGTCCCCGGAGGGC 
CCTGGGGACGCTGGGTGCACTGGAGCAGGAGACCCCTCTTCTTGGCCCTGGCTGTCCTGGTC 
ACCACAGTCCTTTGGGCTGTGATTCITGAGTATCCTATTGTCCAAGGCCTCCACGGAGCGCGC 
GGCGCTGCTTGACGGCCACGACCTGCTGAGGACAAACGCCTCGAAGCAGACGGCGGCGCTGG 
GTGCCCTGAAGGAGGAGGTCGGAGACTGCCACAGCTGCTGCTCGGGGACGCAGGCGCAGCTG 
CAGACCACGCGCGCGGAGCTTGGGGAGGCGCAGGCGAAGCTGATGGAGCAGGAGAGCGCCCT 
GCGGGAACTGCGT6AGCGCGTGACCCAGGGCTTGGCTGAAGCCGGCAGGGGCCGTGAGGACG 
TCCGCACTGAGCTGTTCCGGGCGCTGGAGGCCGTGAGGCTCCAGAACAACTCCTGCGAGCCG 
TGCCCCACGTCGTGGCTGTCCTTCGAGGGCTCCT6CTACTTTTTCTCTGTGCCAAAGACGAC 
GTGGGCGGCGGCGCAGGATCACTGCGCAGATGCCAGCGCGCACCTGGTGATCGTTGGGGGCC 
TGGATGAGCAGGGCTTCCTCACTCGGAACACGCGTGGCCGTGGTTACTGGCTGGGCCTGAGG 
GCTGTGCGCCATCTGGGCAAGGTTCi^GGGCTACCAGTGGGTGGACGGAGTCTCTCTCAGCTT 
CAGCCACTGGAACCAGGGAGAGCCGfy^TGACGCTTGGGGGCGCGAGAACTGTGTCATGATGC 
TGCACACGGGGCTGTGGAACGACGa\CCGTGTGACAGCGAGAAGGACGGCTGGATCTGTGAG 
AAAAGGCACAACTGCTGACCCCGCCCAGTGCCCTGGAGCCGCGCCCATTGCAGCATGTCGTA 
TCCTGGGGGCTGCTCACCTCCCTGGCTCCTGGAGCTGATTGCCAAAGAGTTTTTTTCTTCCT 
CATCCACCGCTGCTGAGTCTCAGAA?kCACTTGGCCCAACATAGCCCTGTCCAGCCCAGTGCC 
TGGGCTCTGGGACCTCCATGCCGACCTCATCCTAACTCCACTCACGCAGACCCAACCTAACC 
TCCACTAGCTCCAAAATCCCTGCTCCTGCGTCCCCGTGATATGCCTCCACTTCTCTCCCTAA 
CCAAGGTTAGGTGACTGAGGACTGGAGCTGTTTGGTTTTCTCGCATTTTCCACCAAACTGGA 
AGCTGTTTTTGCAGCCTGAGGAAGCArCAATAAATATTTGAGAAATGAAAAAA 



FIGURE 85 



< /us r / seqdb2 / s s t /DNA/Dnas eqs . mi n / s s . DNA5 6 3 5 2 
<subunit 1 of 1, 293 aa, 1 stop 
<MW: 32562, pi: 6.53, NX(S/T): 2 

MDTTRYSKWGGSS EEVPGGPWGRWHWSRRPLFLALAVLVTTVLWAVI LS ILLSKASTERAA 
LLDGHDLLRTNASKQTAALGALKEEVGDCHSCCSGTQAQLQTTRAELGEAQAKLMEQESALR 
ELRERVTQGLAEAGRGREDVRTELFl^LEAVRLQNNSCEPCPTSWLSFEGSCYFFSVPKTTW 
AAAQDHCADASAHLVIVGGLDEQGFLTRNTRGRGYWLGLRAVRHLGKVQGYQWVDGVSLSFS 
HWNQGEPNDAWGRENCVMMLHTGLTOIDAPCDSEKDGWICEKRHNC 

Important features: 

Type II transmembrane domain: 

amino acids 31-54 

N-glycosylation sites. 

amino acids 73-76 and 159-162 

Leucine zipper pattern. 

amino acids 102-123 

N-myristoylation sites. 

amino acids 18-23, 133-138 and 242-247 

C-type lectin domain signature. 

amino acids 264-287 



FIGURE 86 



gccaggggaagagggtgatccgacccggggaaggtcgctgggcagggcgagttgggaaagcg 
gcagcccccgccgcccccgcagccc:cttctcctcctttctcccacgtcctatctgcctctcg 
ctggaggccaggccgtgcagcatcgaagacaggaggaactggagcctcattggccggcccgg 
ggcgccggcctcgggcttaaataggfagctccgggctctggctgggacccgaccgctgccggc 
cgcgctcccgctgctcctgccgggtgatggaaaaccccagcccggccgccgccctgggcaag 
gccctctgcgctctcctcctggccactctcggcgccgccggccagcctcttgggggagagtc 
catctgttccgccagagccccggccaaatacagcatcaccttcacgggcaagtggagccaga 
cggccttccccaagcagtaccccctgttccgcccccctgcgcagtggtcttcgctgctgggg 
gccgcgcatagctccgactacagcatgtggaggaagaaccagtacgtcagtaacgggctgcg 
cgactttgcggagcgcggcgaggcctgggcgctgatgaaggagatcgaggcggcgggggagg 
cgctgcagagcgtgcacgaggtgttttcggcgcccgccgtccccagcggcaccgggcagacg 
tcggcggagctggaggtgcagcgcaggcactcgctggtctcgtttgtggtgcgcatcgtgcc 
cagccccgactggttcgtgggcgtggacagcctggacctgtgcgacggggaccgttggcggg 
aacaggcggcgctggacctgtacccctacgacgccgggacggacagcggcttcaccttctcc 
tcccccaacttcgccaccatcccga?lggacacggtgaccgagataacgtcctcctctcccag 
ccacccggccaactccttctactacccgcggctgaaggccctgcctcccatc6ccagggtga 
cactgctgcggctgcgacagagccccagggccttcatccctcccgccccagtcctgcccagc 
agggacaatgagattgtagacagcgcctcagttccagaaacgccgctggactgcgaggtctc 
cctgtggtcgtcctggggactgtgcggaggccactgtgggaggctcgggaccaagagcagga 
ctcgctacgtccgggtccagcccgcc::aacaacgggagcccctgccccgagctcgaagaagag 
gctgagtgcgtccctgataactgcgtctaagaccagagccccgcagcccctggggccccccg 
gagccatggggtgtcgggggctcctgtgcaggctcatgctgcaggcggccgagggcacaggg 
ggtttcgcgctgctcctgaccgcggtgaggccgcgccgaccatctctgcactgaagggccct 
ctggtggccggcacgggcattgggajsacagcctcctcctttcccaaccttgcttcttagggg 
cccccgtgtcccgtctgctctcagccjtcctcctcctgcaggataaagtcatccccaaggctc 
cagctactctaaattatgtctccttataagttattgctgctccaggagattgtccttcatcg 
tccaggggcctggctcccacgtggttgcagatacctcagacctggtgctctaggctgtgctg 
agcccactctcccgagggcgcatccaagcgggggccacttgagaagtgaataaatggggcgg 
tttcggaagcgtcagtgtttccatgttatggatctctctgcgtttgaataaagactatctct 

GTTGCTCACAAAAAAAAAAAAAAAAA^AAAAAAAAAAAAAAAAAAAJ^ 



FIGURE 87 



></usr/seqdb2/sst/DNA/Dnasegs.min/ss.DNA53971 
xsubunit 1 of 1, 331 aa, 1 stop 
xMW: 35844, pi: 5.45, NXiS/T) z 2 

MENPS PAAALGKALCALLLATLGAAGQPLGGES I CSARAPAKYS ITFTGKWSQTAFPKQYPL 
FRPPAQWSSLLGAAHSSDYSMWRKINQYVSNGLRDFAERGEAWALMKEIEAAGEALQSVHEVF 
SAPAVPSGTGQTSAELEVQRRHSLVSFWRIVPSPDWFVGVDSLDLCDGDRWREQAALDLYP 
YDAGTDSGFTFS S PNFAT IPQDTVTEITS SSPSHPANSFYYPRLKALPP I ARVTLLRLRQSP 
RAFIPPAPVLPSRDNEIVDSASVPETPLDCEVSLWSSWGLCGGHCGRLGTKSRTRYVRVQPA 
NNGSPCPELEEEAECVPDNCV 

Important features: 
Signal peptide: 

amino acids 1-26 



BlUllilt 
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GGCGGCGTCCGTGAGGGGCTCCTTTGGGCAGGGGTAGTGTTTGGTGTCCCTGTCTTGCGTGA 
TATTGACAAACTGAAGCTTTCCTGCACCACTGGACTTAAGGAAGAGTGTACTCGTAGGCGGA 
CAGCTTTAGTGGCCGGCCGGCCGCTCTCATCCCCCGTAAGGAGCAGAGTCCTTTGTACTGAC 
CAAGATGAGCAACATCTACATCCAGGAGCCTCCCACGAATGGGAAGGTTTTATTGAAAACTA 
CAGCTGGAGATATTGACATAGAGTTGTGGTCCAAAGAAGCTCCTAAAGCTTGCAGAAATTTT 
ATCCAACTTTGTTTGGAAGCTTATTATGACAATACCATTTTTCATAGAGTTGTGCGTGGTTT 
CATAGTCCAAGGCGGAGATCCTACTGGCACAGGGAGTGGTGGAGAGTCTATCTATGGAGCGC 
CATTCAAAGATGAATTTCATTCACGGTTGCGTTTTAATCGGAGAGGACTGGTTGCCATGGCA 
AATGCTGGTTCTCATGATAATGGCAGCCAGTTTTTCTTCACACTGGGTCGAGCAGATGAACT 
TAACAATAAGCATACCATCTTTGG^^GGTTACAGGGGATACAGTATATAACATGTTGCGAC 
TGTCAGAAGTAGACATTGATGATGA.CGAAAGACCACATAATCCACACAAAATAAAAAGCTGT 
GAGGTTTTGTTTAATCCTTTTGATGACATCATTCCAAGGGAAATTAAAAGGCTGAAAAAAGA 
GAAACCAGAGGAGGAAGTAAAGAAATTGAAACCCAAAGGCACAAAAAATTTTAGTTTACTTT 
CATTTGGAGAGGAAGCTGAGGAAGAAGAGGAGGAAGTAAATCGAGTTAGTCAGAGCATGAAG 
GGCAAAAGCAAAAGTAGTCATGACTTGCTTAAGGATGATCCACATCTCAGTTCTGTTCCAGT 
TGTAGAAAGTGAA2^AAGGTGATGCACCAGATTTAGTTGATGATGGAGAAGATGAAAGTGCAG 
AGCATGATGAATATATTGATGGTGATGAAAAGAACCTGATGAGAGAAAGAATTGCCAAAAAA 
TTAAAAAAGGACACAAGTGCGAATGTTAAATCAGCTGGAGAAGGAGAAGTGGAGAAGA^lATC 
AGTCAGCCGCAGTGAAGAGCTCAGAAAAGAAGCAAGACAATTAAAACGGGAACTCTTAGCAG 
CAAAACAAAAAAAAGTAGAAAATGC^GCAAAACAAGCAGAAAAAAGAAGTGAAGAGGAAGAA 

gcccctccagatggtgctgttgcc6i\atacagaagagaaaagcaaaagtatgaagctttgag 
gaagcaacagtcaaagaagggaacttcccgggaagatcagacccttgcactgctgaaccagt 
ttaaatctaaactcactcaagcaattgctgaaacacctgaaaatgacattcctgaaacagaa 
gtagaagatgatgaaggatggatgtcacatgtacttcagtttgaggataaaagcagailaagt 
gaaagatgcaagcatgcaagactcagatacatttgaaatctatgatcctcggaatccagtga 
ataaaagaaggagggaagaaagcaa^aagctgatgagagagaaaaaagaaagaag ataa aat 
gagaataatgataaccagaacttgctggaaatgtgcctacaatggccttgtaacagccattg 
ttcccaacagcatcacttaggggtgtgaaaagaagtatttttgaacctgttgtctggttttg 
aaaaacaattatcttgttttgcaaattgtggaatgatgtaagcaaatgcttttggttactgg 
tacatgtgttttttcctagctgaccitttatattgctaaatctgaaataaaataactttcct 
tccacaaaaaaaaaaaaaaaaaaaaa?y^aaaaaa 



I 



inHDiumiii 
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></usr/segdb2/sst/DNA/Dnaseqs .Tnin/ss.DNA50919 
xsubunit 1 of 1^ 472 aa, 1 stop 
xMW: 53847, pi: 5.75, NX(S/T): 2 

MSNIYIQEPPTNGKVLLKTTAGDIDIELWSKEAPKACRNFIQLCLEAYYDNTIFHRWPGFI 
VQGGDPTGTGSGGESIYGAPFKDEFHSRLRFNRRGLVAMANAGSHDNGSQFFFTLGRADELN 
NKHTI FGKVTGDTVYNMLRLS EVDIDDDERPHNPHKI KSCEVLFNPFDD 1 1 PREIKRLKKEK 
PEEEVKKLKPKGTKNFSLLSFGEEAEEEEEEVNRVSQSMKGKSKSSHDLLKDDPHLSSVPW 
ESEKGDAPDLVDDGEDESAEHDEYir)GDEKNLMRERIAKiaiKKDTSANVKSAGEGEVEKKSV 
SRSEELRKEARQLKRELLAAKQKKVSNAAKQAEKRSEEEEAPPDGAVAEYRREKQKYEALRK 
QQSKKGTSREDQT]^LNQFKSKLTQAIAETPENDIPETEVEDDEGWMSHVLQFEDKSRKVK 
DASMQDSDTFEIYDPRNPVNKRRREEISKKLMREKKERR 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 109-112 and 201-204 

Cyclophilin-type pept idyl -prolyl cis-trans isomerase signature. 

amino acids 49-66 

Homologous region to Cyclophilin-type pept idyl -prolyl cis-trans 
isomerase 

amino acids 96-140, 49-89 and 22-51 



FIGURE 90 



CGCCGCCGTTGGGGCTGGAAGTTCCCGCCAGGTCCGTGCCGGGCGAGAGAGATGCTGCCCGG 
CCCGCCTCGGCTTTGAGGCGAGAGAAGTGTCCCAGACCCATTTCGCCTTGCTGACGGCGTCG 
AGCCCTGGCCAGAC ATG TCCACAGGGTTCTCCTTCGGGTCCGGGACTCTGGGCTCCACCACC 
GTGGCCGCCGGCGGGACCAGCACAGGCGGCGTTTTCTCCTTCGGAACGGGAACGTCTAGCAA 
CCCTTCTGTGGGGCTCAATTTTGGAJ^ATCTTGGAAGTACTTCAACTCCAGCAACTACATCTG 
CTCCTTCAAGTGGTTTTGGAACCGGGCTCTTTGGATCTAAACCTGCCACTGGGTTCACTCTA 
GGAGGAACAAATACAGGTGCCTTGCACACCAAGAGGCCTCAAGTGGTCACCAAATATGGAAC 
CCTGCAAGGAAAACAGATGCATGTG(3GGAAGACACCCATCCAAGTCTTTTTAGGAGTCCCCT 
TCTCCAGACCTCCTCTAGGTATCCTCAGGTTTGCACCTCCAGAACCCCCGGAGCCCTGGAAA 
GGAATCAGAGATGCTACCACCTACCCGCCTGGATGGAGTCTCGCTCTGTCGCCAGGCTGGAG 
TGCAGTGGCACGATCTCGGCTCACTGCAACCTCCGCCTCCCGGGTTCAAGCGAGTCTCCTGC 
CTCAGCCTCTGAGTGTCTGGGGCTACilAGGTGCCTGCAGGAGTCCTGGGGCCAGCTGGCCTCG 
ATGTACGTCAGCACGCGGGAACGGTACAAGTGGCTGCGCTTCAGCGAG6ACTGTCTGTACCT 
GAACGTGTACGCGCCGGCGCGCGCGCCCGGGGATCCCCAGCTGCCAGTGATGGTCTGGTTCC 
CGGGAGGCGCCTTCATCGTGGGCGCTGCTTCTTCGTACGAGGGCTCTGACTTGGCCGCCCGC 
GAGAAAGTGGTGCTGGTGTTTCTGCAGCACAGGCTCGGCATCTTCGGCTTCCTGAGCACGGA 
CGACAGCCACGCGCGCGGGAACTGGGGGCTGCTGGACCAGATGGCGGCTCTGCGCTGGGTGC 
AGGAGAACATCGCAGCCTTCGGGGGAGACCCAGGAAATGTGACCCTGTTCGGCCAGTCGGCG 
GGGGCCATGAGCATCTCAGGACTGATGATGTCACCCCTAGCCTCGGGTCTCTTCCATCGGGC 
CATTTCCCAGAGTGGCACCGCGTTATTCAGACTTTTCATCACTAGTAACCCACTGAAAGTGG 
CCAAGAAGGTTGCCCACCT6GCTGGATGCAACCACAACAGCACACAGATCCTGGTAAACTGC 
CTGAGGGCACTATCAGGGACCAAGGTGATGCGTGTGTCCAACAAGATGAGATTCCTCCAACT 
GAACTTCCAGAGAGACCCGGAAGAGATTATCTGGTCCATGAGCCCTGTGGTGGATGGTGTGG 
TGATCCCAGATGACCCTTTGGTGCTCJCTGACCCAGGGGAAGGTTTCATCTGTGCCCTACCTT 
CTAGGTGTCAACAACCTGGAATTCAJi-TTGGCTCTTGCCTTATAATATCACCAAGGAGCAGGT 
ACCACTTGTGGTGGAGGAGTACCTGGACAATGTCAATGAGCATGACTGGAAGATGCTACGAA 
ACCGTATGATGGACATAGTTCAAGATGCCACTTTCGTGTATGCCACACTGCAGACTGCTCAC 
TACCACCGAGAAACCCCAATGATGGGfAATCTGCCCTGCTGGCCACGCTACAACAAGGATGAA 
AAGTACCTGCAGCTGGATTTTACCACAAGAGTGGGCATGAAGCTCAAGGAGAAGAAGATGGC 
TTTTTGGATGAGTCTGTACCAGTCTCAAAGACCTGAGAAGCAGAGGCAATTCTAAGGGTGGC 
TATGCAGGAAGGAGCCAAAGAGGGGTTTGCCCCCACCATCCAGGCCCTGGGGAGACTAGCCA 
TGGACATACCTGGGGACAAGAGTTCTACCCACCCCAGTTTAGAACTGCAGGAGCTCCCTGCT 
GCCTCCAGGCCAAAGCTAGAGCTTTT'GCCTGTTGTGTGGGACCTGCACTGCCCTTTCCAGCC 
TGACATCCCATGATGCCCCTCTACTT'CACTGTTGACATCCAGTTAGGCCAGGCCCTGTCAAC 
ACCACACTGTGCTCAGCTCTCCAGCCTCAGGACAACCTCTTTTTTTCCCTTCTTCAAATCCT 
CCCACCCTTCAATGTCTCCTTGTGACTCCTTCTTATGGGAGGTCGACCCAGACTGCCACTGC 
CCCTGTCACTGCACCCAGCTTGGCATrTACCATCCATCCTGCTCAACCTTGTTCCTGTCTGT 
TCACATTGGCCTGGAGGCCTAGGGCAGGTTGTGACATGGAGCAAACTTTTGGTAGTTTGGGA 
TCTTCTCTCCCACCCACACTTATCTCCCCCAGGGCCACTCCAAAGTCTATACACAGGGGTGG 
TCTCTTCAATAAAGAAGTGTTGATTAGAAAAAAAAAAA 



FIGURE 91 



< / us r / s egdb2 / s s t /DNA/Dna seqs.min/ss. DNA4 4179 
<s-abunlt 1 of 1, 545 aa, 1 stop 
<MW: 58934, pi: 9.45, NX(S/T): 4 

MSTGFSFGSGTLGSTTVAAGGTSTGfGVFSFGTGTSSNPSVGLNFGNLGSTSTPATTSAPSSG 
FGTGLFGSKPATGFTLGGTNTGALHTKRPQWTKYGTLQGKQMHVGKTPIQVFLGVPFSRPP 
LGILRFAPPEPPEPWKGIRDATTYPPGWSLALSPGWSAVARSRLTATSASRVQASLLPQPLS 
WGYRCLQESWGQLASMYVSTRERYKWLRFSEDCLYLIWYAPARAPGDPQLPVMVWFPGGAF 
IVGAASSYEGSDLAAREKWLVFLQHRLGIFGFLSTDDSHARGNWGLLDQMAALRWVQENIA 
AFGGDPGNVTLFGQSAGAMSISGLMMSPLASGLFHRAISQSGTALFRLFITSNPLKVAKKVA 
HLAGCNHNSTQI LVNCLRALSGTKVl^^RVSNKMRFLQLNFQRDPEE 1 1 WSMS PWDGWI PDD 
PLVLLTQGKYSSVPYLLGVmLEFlSnWLLPYNITKEQVPLWEEYLDNVNEHDWKMLRNRMiyro 
IVQDATFVYATLQTAHYHRETPMMGIICPAGHATTRMKSTCSWILPQEWA 

Important features: 
Signal peptide: 

amino acids 1-29 

Carboxylesterases type-B serine active site. 

amino acids 312-327 

Carboxylesterases type-B signature 2. 

amino acids 218-228 

N-glycosylation sites. 

amino acids 318-321, 380-383 and 465-468 



mm 



FIGURE 92 

GAGAACAGGCCTGTCTCAGGCAGGCCCTGCGCCTCCTATGCGGAGATGCTACTGCCACTGCT 
GCTGTCCTCGCTGCTGGGCGGGTCC:CAGGCTATGGATGGGAGATTCTGGATACGAGTGCAGG 
AGTCAGTGATGGTGCCGGAGGGCCirGTGCATCTCTGTGCCCTGCTCTTTCTCCTACCCCCGA 
CAAGACTGGACAGGGTCTACCCCAGCTTATGGCTACTGGTTCAAAGCAGTGACTGAGACAAC 
CAAGGGTGCTCCTGTGGCCACAAACCACCAGAGTCGAGAGGTGGAAATGAGCACCCGGGGCC 
GATTCCAGCTCACTGGGGATCCCGCCAAGGGGAACTGCTCCTTGGTGATCAGAGACGCGCAG 
ATGCAGGATGAGTCACAGTACTTCTTTCGGGTGGAGAGAGGAAGCTATGTGACATATAATTT 
CATGAACGATGGGTTCTTTCTAAAAGTAACAGTGCTCAGCTTCACGCCCAGACCCCAGGACC 
ACAACACCGACCTCACCTGCCATGTGGACTTCTCCAGAAAGGGTGTGAGCGCACAGAGGACC 
GTCCGACTCCGTGTGGCCTATGCCCCCAGAGACCTTGTTATCAGCATTTCACGTGACAACAC 
GCCAGCCCTGGAGCCCCAGCCCCAG!GGAAATGTCCCATACCTGGAAGCCCAAAAAGGCCAGT 
TCCTGCGGCTCCTCTGTGCTGCTGACAGCCAGCCCCCTGCCACACTGAGCTGGGTCCTGCAG 
AACAGAGTCCTCTCCTCGTCCCATCCCTGGGGCCCTAGACCCCTGGGGCTGGAGCTGCCCGG 

ggtgaaggctggggattcagggcgc:tacacctgccgagcggagaacaggcttggctcccagc 
agcgagccctggacctctctgtgcagtatcctccagagaacctgagagtgatggtttcccaa 
gcaaacaggacagtcctggaaaaccttgggaacggcacgtctctcccagtactggagggcca 

AAGCCTGTGCCTGGTCTGTGTCACACACAGCAGCCCCCCAGCCAGGCTGAGCTGGACCCAGA 

ggggacaggttctgagcccctcccj^.gccctcagaccccggggtcctggagctgcctcgggtt 
caagtggagcacgaaggagagttcacctgccacgctcggcacccactgggctcccagcacgt 
ctctctcagcctctccgtgcactataagaagggactcatctcaacggcattctccaacggag 

CGTTTCTGGGAATCGGCATCACGGCTCTTCTTTTCCTCTGCCTGGCCCTGATCATCATGAAG 

attctaccgaagagacggactcagacagaaaccccgaggcccaggttctcccggcacagcac 

GATCCTGGATTACATCAATGTGGTCCCGACGGCTGGCCCCCTGGCTCAGAAGCGGAATCAQA 

aagccacaccaaacagtcctcggacccctcctccaccaggtgctccctccccagaatcaaag 

AAGAACCAGAAAAAGCAGTATCAGTTGCCCAGTTTCCCAGAACCCAAATCATCCACTCAAGC 
CCCAGAATCCCAGGAGAGCCAAGAGGAGCTCCATTATGCCACGCTCAi!..CTTCCCAGGCGTCA 
GACCCAGGCCTGAGGCCCGGATGCC'CAAGGGCACCCAGGCGGATTATGCAGAAGTCAAGTTC 
CAATQAGGGTCTCTTAGGCTTTAGGACTGGGACTTCGGCTAGGGAGGAAGGTAGAGTAAGAG 
GTTGAAGATAACAGAGTGCAAA6TTTCCTTCTCTCCCTCTCTCTCTCTCTTTCTCTCTCTCT 
CTCTCTTTCTCTCTCTTTTAAAAAAI^CATCTGGCCAGGGCACAGTGGCTCACGCCTGTAATC 
CCAGCACTTTGGGAGGTTGAGGTGGGCAGATCGCCTGAGGTCGGGAGTTCGAGACCAGCCTG 
GCCAACTTGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGCATGGTGGCAGG 
CGCCTGTAATCCTACCTACTTGGGAAGCTGAGGCAGGAGAATCACTTGAACCTGGGAGACGG 
AGGTTGCAGTGAGCCAAGATCACACCATTGCACGCCAGCCTGGGCAACAAAGCGAGACTCCA 
TCTCAAAAAAAAAATCCTCCAAATGGGTTGGGTGTCTGTAATCCCAGCACTTTGGGAGGCTA 
AGGTGGGTGGATTGCTTGAGCCCAGGAGTTCGAGACCAGCCTGGGCAACATGGTGAAACCCC 
ATCTCTACAAAAAATACAAAACATAGCTGGGCTTGGTGGTGTGTGCCTGTAGTCCCAGCTGT 
CAGACATTTAAACCAGAGCAACTCCilTCTGGAATAGGAGCTGAATAAAATGAGGCTGAGACC 
TACTGGGCTGCATTCTCAGACAGTGGAGGCATTCTAAGTCACAGGATGAGACAGGAGGTCCG 
TACAAGATACAGGTCATAAAGACTTTGCTGATAAAACAGATTGCAGTAAAGAAGCCAACCAA 
ATCCCACCAAAACCAAGTTGGCCACGAGAGTGACCTCTGGTCGTCCTCACTGCTACACTCCT 
GACAGCACCATGACAGTTTACAAATC3CCATGGCAACATCAGGAAGTTACCCGATATGTCCCA 
AAAGGGGGAGGAATGAATAATCCACCCCTTGTTTAGCAAATAAGCAAGAAATAACCATAAAA 
GTGGGCAACCAGCAGCTCTAGGCGCTGCTCTTGTCTATGGAGTAGCCATTCTTTTGTTCCTT 
TACTTTCTTAATAAACTTGCTTTCACCTTAAAAAAA 



FIGURE 93 

>< /us r / s eqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA5 4 0 0 2 
xsxxbunit 1 of 1, 544 aa, 1 stop 
xMW: 60268, pi: 9.53, NX (S/T) : 3 

MLLPLLLSSLLGGSQAMDGRFWIRVQESVMVPEGLCISVPCSFSYPRQDWTGSTPAYGYWFK 
AVTETTKGAPVATNHQSREVEMSTRGRFQLTGDPAKGNCSLVIRDAQMQDESQYFFRVERGS 
YWYNFiynSTOGFFLKVTVLSFTPRPQDHNTDLTCHVDFSRKGVSAQRTVRLRVAYAPRDLVIS 
ISRDNTPALEPQPQGNVPYLEAQKGQFLRLLCAADSQPPATLSWVLQNRVLSSSHPWGPRPL 
GLELPGVKAGDSGRYTCRAENRLGSQQRALDLSVQYPPENLRVMVSQANRTVLENLGNGTSL 
PVLEGQSLCLVCVTHSSPPARLSWTQRGQVLSPSQPSDPGVLELPRVQVEHEGEFTCHARHP 
LGSQHVSLSLSVHYKKGLISTAFSNGAFLGIGITALLFLCLALIIMKILPKRRTQTETPRPR 
FSRHSTILDYINWPTAGPLAQKRNQKATPNSPRTPPPPGAPSPESKKNQKKQYQLPSFPEP 
KSSTQAPESQESQEELHYATLNFPG\^PRPEARMPKGTQADYAEVKFQ 

Important features: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 3 99-418 

N-glycosylation site. 

amino acids 100-103, 297-300 and 306-309 

Immunoglobulins and major histocompatibility complex proteins 
signature . 

amino acids 365-371 



TGAAGAGTAATAGTTGGAATCAAAAGAGTCAACGCAATGAACTGTTATTTACTGCTGCGTTT 
TATGTTGGGAATTCCTCTCCTATGGCCTTGTCTTGGAGCAACAGAAAACTCTCAAACAAAGA 
AAGTCAAGCAGCCAGTGCGATCTCATTTGAGAGTGAAGCGTGGCTGGGTGTGGAACCAATTT 
TTTGTACCAGAGGAAATGAATACGACTAGTCATCACATCGGCCAGCTAAGATCTGATTTAGA 
CAATGGAAACAATTCTTTCCAGTACJiAGCTTTTGGGAGCTGGAGCTGGAAGTACTTTTATCA 
TTGATGAAAGAACAGGTGACATATATGCCATACAGAAGCTTGATAGAGAGGAGCGATCCCTC 
TACATCTTAAGAGCCCAGGTAATAGACATCGCTACTGGAAGGGCTGTGGAACCTGAGTCTGA 
GTTTGTCATCAAAGTTTCGGATATCi!ATGACAATGAACCAAAATTCCTAGATGAACCTTATG 
AGGCCATTGTACCAGAGATGTCTCCAGAAGGAACATTAGTTATCCAGGTGACAGCAAGTGAT 
GCTGACGATCCCTCAAGTGGTAATAi^TGCTCGTCTCCTCTACAGCTTACTTCAAGGCCAGCC 
ATATTTTTCTGTTGAACCAACAACAGGAGTCATAAGAATATCTTCTAAAATGGATAGAGAAC 
TGCAAGATGAGTATTGGGTAATCATTCAAGCCAAGGACATGATTGGTCAGCCAGGAGCGTTG 
TCTGGAACAACAAGTGTATTAATTAJ^CTTTCAGATGTTAATGACAATAAGCCTATATTTAA 
AGAAAGTTTATACCGCTTGACTGTCTCTGAATCTGCACCCACTGGGACTTCTATAGGAACAA 

tcatggcatatgataatgacataggagagaatgcagaaatggattacagcattgaagaggat 
gattcgcaaacatttgacattattac:taatcatgaaactcaagaaggaatagttatattaaa 
aaagaaagtggattttgagcaccagiuvccactacggtattagagcaaaagttaaaaaccatc 
atgttcctgagcagctcatgaagtac:cacactgaggcttccaccactttcattaagatccag 

GTGGAAGATGTTGATGAGCCTCCTCTTTTCCTCCTTCCATATTATGTATTTGAAGTTTTTGA 

agaaaccccacagggatcatttgtaggcgtggtgtctgccacagacccagacaataggaaat 

CTCCTATCAGGTATTCTATTACTAGCJAGCAAAGTGTTCAATATCAATGATAATGGTACAATC 
ACTACAAGTAACTCACTGGATCGTG^vAATCAGTGCTTGGTACAACCTAAGTATTACAGCCAC 
AGAAAAATACAATATAGAACAGATCTCTTCGATCCCACTGTATGTGCAAGTTCTTAACATCA 
ATGATCATGCTCCTGAGTTCTCTCA/iTACTATGAGACTTATGTTTGTGAAAATGCAGGCTCT 
GGTCAGGTAATTCAGACTATCAGTGCAGTGGATAGAGATGAATCCATAGAAGAGCACCATTT 
TTACTTTAATCTATCTGTAGAAGACA.CTAACAATTCAAGTTTTACAATCATAGATAATCAAG 
ATAACACAGCTGTCATTTTGACTAATAGAACTGGTTTTAACCTTCAAGAAGAACCTGTCTTC 
TACATCTCCATCTTAATTGCCGACA7tTGGAATCCCGTCACTTACAAGTACAAACACCCTTAC 
CATCCATGTCTGTGACTGTGGTGACAGTGGGAGCACACAGACCTGCCAGTACCAGGAGCTTG 

TTTGGGTTTATTT^"^"^^^^^^^^'^!^^^^^ 

gaaaagtgaagatttcagagagaatatattccaatatgatgatgaagggggtSagaagaag 

ATACAGAGGCCTTTGATATAGCAGAGCTGAGGAGTAGTACCATAATGCGGGAACGCAAGACT 

cggaaaaccacaagcgctgagatcaggagcctatacaggcagtctttgcaagttggccccga 

CAGTGCCATATTCAGGAAATTCATTCTGGAAAAGCTCGAAGAAGCTAATACTGATCCGTGTG 

cccctccttttgattccctccagacctacgcttttgagggaacagggtcattagctggatcc 

CTGAGCTCCTTAGAATCAGCAGTCTCTGATCAGGATGAAAGCTATGATTACCTTAATGAGTT 

gggacctcgctttaaaagattagcatgcatgtttggttctgcagtgcagtcaaataattagg 

GCTTTTTACCATCAAAATTTTTAAAAGTGCTAATGTGTATTCGAACCCAATGGTAGTCTTAA 

agagttttgtgccctggctctatggcggggaaagccctagtctatggagttttctgatttcc 

CTGGAGTAAATACTCCATGGTTATTTTAAGCTACCTACATGCTGTCATTGAACAGAGATGTG 
GGGAGAAATGTAAACAATCAGCTCAC?IGGCATCAATACAACCAGATTTGAAGTAAAATAATG 
TAGGAAGATATTAAAAGTAGATGAGAGGACACAAGATGTAGTCGATCCTTATGCGATTATAT 
CATTATTTACTTAGGAAAGAGTAAAAATACCAAACGAGAAAATTTAAAGGAGCAAAAATTTG 
CAAGTCAAATAGAAATGTACAAATCGAGATAACATTTACATTTCTATCATATTGACATGAAA 
ATTGAAAATGTATAGTCAGAGAAATTTTCATGAATTATTCCATGAAGTATTGTTTCCTTTAT 



FIGURE 95 



></usr/seqdb2/ sst/DNA/Dnaseqs . min/ss - DNA53 90 6 
xsTibunit 1 of 1, 772 aa, 1 stop 
xMW: 87002, pi: 4.64, ]SIX(S/T): 8 

MNCYLLLRFMLGIPLLWPCLGATENSQTKKVKQPWSHLRVKRGWVWNQFFVPEEMNTTSHH 
IGQLRSDLDNGNNSFQYKLLGAGAGSTFIIDERTGDIYAIQKLDREERSLYILRAQVIDIAT 
GRAVEPESEFVIKVSDINDHEPKFLDEPYEAIVPEMSPEGTLVIQVTASDADDPSSGNNARL 
LYSLLQGQPYFSVEPTTGVIRISSKMDRELQDEYWVIIQAKDMIGQPGALSGTTSVLIKIiSD 
VNDNKPIFKESLYllLTVSESAPTGTSIGTIMAYDNDIGENAEMDYSIEEDDSQTFDIITNHE 
TQEGIVILKKKVBFEHQNHYGIRAKVKNHHVPEQLMKYHTEASTTFIKIQVEDVDEPPLFLL 
PYYVFEVFEETPQGSFVGWSATDPDNRKSPIRYSITRSKVFNINDNGTITTSNSLDREISA 
WYNLSITATEKYNIEQI SS I PLYVQVLNINDHAPEFSQYYETYVCENAGSGQVIQTISAVDR 
DESIEEHHFYFNLSVEDTNNSSFTIIDNQDNTAVILTNRTGFNLQEEPVFYISILIADNGIP 
SLTSTNTLTIHVCDCGDSGSTQTCQYQELVLSMGFKTEVIIAILICIMIIFGFIFLTLGLKQ 
RRKQILFPEKSEDFRENIFQYDDEGGGEEDTEAFDIAELRSSTIMRERKTRKTTSAEIRSLY 
RQSLQVGPDSAIFRKFILEKLEEANTDPCAPPFDSLQTYAFEGTGSLAGSLSSLESAVSDQD 
ESYDYLNELGPRFKRLACMFGSAVQSNN 

Important features: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 597-617 

N-glycosylation sites. 

amino acids 57-60, 74-77, 419-423, 437-440, 508-511, 515-518, 
516-519 and 534-537 

Cadherins extracellular repeated domain signature. 

amino acids 136-146 and 244-254 



FIGURE 96 



ATTTCAAGGCCAGCCATATTTTTNTGTTGAACCAACAACAGGAGTCATAAGAATATTTTNTA 
AAATGGATAGAGAACTGCAAGATGAGTATTGGGTAATCATTCAAGCCAAGGACATGATTGGT 
CAGCCAGGAGCGTTGTNTGGAACAJiCAAGTGTATTAATTAAACTTTCAGATGTTAATGACAA 
TAAGCCTATATTTAAAGAAAGTTTATACCGCTTGACTGTNTNTGAATCTGCACCCACTGGGA 
NTTNTATAGGAACAATCATGGCATATGATAATGACATAGGAGAGAATGCAGAAATGGATTAC 
AGCATTGAAGAGGATGATTCGCAAA.CATTTGACATTATT 



GCAACCTCAGCTTCTAGTATCCAGACTCCAGCGCCGCCCCGGGCGCGGACCCCAACCCCGAC 
CCAGAGCTTCTCCAGCGGCGGCGCAGCGAGCAGGGCTCCCCGCCTTAACTTCCTCCGCGGGG 
CCCAGCCACCTTCGGGAGTCCGGGTTGCCCACCTGCAAACTCTCCGCCTTCTGCACCTGCCA 
CCCCTGAGCCAGCGCGGGCCCCCGAGCGAGTCATGGCCAACGCGGGGCTGCAGCTGTTGGGC 
TTCATTCTCGCCTTCCTGGGATGGATCGGCGCCATCGTCAGCACTGCCCTGCCCCAGTGGAG 
GATTTACTCCTATGCCGGCGACAACATCGTGACCGCCCAGGCCATGTACGAGGGGCTGTGGA 
TGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCTTTGACTCCTTGCTGAAT 
CTGAGCAGCACATTGCAAGCAACCCGTGCCTTGATGGTGGTTGGCATCCTCCTGGGAGTGAT 
AGCAATCTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGTGCTTGGAAGACGATGAGGTGC 
AGAAGATGAGGATGGCTGTCATTGGGGGTGCGATATTTCTTCTTGCAGGTCTGGCTATTTTA 
GTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAATTCTATGACCCTATGACCCCAGT 
CAATGCCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCC 
TTCTG6GAGGTGCCCTACTTTGCTGTTCCTGTCCCCGAAAAACAACCTCTTACCCAACACCA 
AGGCCCTATCCAAAACCTGCACCTTCCAGCGGGAAAGACTACGTGTGACACAGAGGCAAAAG 
GAGAAAATCATGTTGAAACAAACCGAAAATGGACATTGAGATACTATCATTAACATTAGGAC 
CTTAGAATTTTGGGTATTGTAATCTGAAGTATGGTATTACAAAACAAACAAACAAACAAAAA 
ACCCATGTGTTAAAATACTCAGTGCTAAACATGGCTTAATCTTATTTTATCTTCTTTCCTCA 
ATATAGGAGGGAAGATTTTTCCATTTGTATTACTGCTTCCCATTGAGTAATCATACTCAAAT 
GGGGGAAGGGGTGCTCCTTAAATATATATAGATATGTATATATACATGTTTTTCTATTAAAA 
ATAGACAGTAAAATACTATTCTCATTATGTTGATACTAGCATACTTAAAATATCTCTAAAAT 
AGGTAAATGTATTTAATTCCATATTGATGAAGATGTTTATTGGTATATTTTCTTTTTCGTCC 
TTATATACATATGTAACAGTCAAATATCATTTACTCTTCTTCATTAGCTTTGGGTGCCTTTG 
CCACAAGACCTAGCCTAATTTACCAi\GGATGAATTCTTTCAATTCTTCATGCGTGCCCTTTT 
CATATACTTATTTTATTTTTTACCATAATCTTATAGCACTTGCATCGTTATTAAGCCCTTAT 
TTGTTTTGTGTTTCATTGGTCTCTATCTCCTGAATCTAACACATTTCATAGCCTACATTTTA 
GTTTCTAAAGCCAAGAAGAATTTATTACAAATCAGAACTTTGGAGGCAAATCTTTCTGCATG 
ACCAAAGTGATAAATTCCTGTTGACCTTCCCACACAATCCCTGTACTCTGACCCATAGCACT 
CTTGTTTGCTTTGAAAATATTTGTCCAATTGAGTAGCTGCATGCTGTTCCCCCAGGTGTTGT 
AACACAACTTTATTGATTGAATTTTTAAGCTACTTATTCATAGTTTTATATCCCCCTAAACT 
ACCTTTTTGTTCCCCATTCCTTAATTGTATTGTTTTCCCAAGTGTAATTATCATGCGTTTTA 
TATCTTCCTAATAAGGTGTGGTCTGTTTGTCTGAACAAAGTGCTAGACTTTCTGGAGTGATA 
ATCTGGTGACAAATATTCTCTCTGTAGCTGTAAGCAAGTCACTTAATCTTTCTACCTCTTTT 
TTCTATCTGCCAAATTGAGATAATGATACTTAACCAGTTAGAAGAGGTAGTGTGAATATTAA 
TTAGTTTATATTACTCTTATTCTTTGAACATGAACTATGCCTATGTAGTGTCTTTATTTGCT 
CAGCTGGCTGAGACACTGAAGAAGTCACTGAACAAAACCTACACACGTACCTTCATGTGATT 
CACTGCCTTCCTCTCTCTACCAGTCTATTTCCACTGAACAAAACCTACACACATACCTTCAT 
GTGGTTCAGTGCCTTCCTCTCTCTACCAGTCTATTTCCACTGAACAAAACCTACGCACATAC 
CTTCATGTGGCTCAGTGCCTTCCTCTCTCTACCAGTCTATTTCCATTCTTTCAGCTGTGTCT 
GACATGTTTGTGCTCTGTTCCATTTTAACAACTGCTCTTACTTTTCCAGTCTGTACAGAATG 
CTATTTCACTTGAGCAAGATGATGTj?^TGGAAAGGGTGTTGGCACTGGTGTCTGGAGACCTG 
GATTTGAGTCTTGGTGCTATCAATCACCGTCTGTGTTTGAGCAAGGCATTTGGCTGCTGTAA 
GCTTATTGCTTCATCTGTAAGCGGTGGTTTGTAATTCCTGATCTTCCCACCTCACAGTGATG 
TTGTGGGGATCCAGTGAGATAGAATACATGTAAGTGTGGTTTTGTAATTTAAAAAGTGCTAT 
ACTAAGGGAAAGAATTGAGGAATTAi\CTGCATACGTTTTGGTGTTGCTTTTCAAATGTTTGA 
AAATAAAAAAAATGTTAAG 



FIGURE 98 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA52185 
xsiibunit 1 of 1, 211 aa, 1 stop 
xMW: 22744, pi: 8.51, NX{S/T): 1 

MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYAGDNIVTAQAMYEGLWMSCVSQSTGQI 
QCKVFDSLLNLSSTLQATRALMWGILLGVIAIFVATVGMKCMKCLEDDEVQKMRMAVIGGA 
IFLLAGLAILVATAWYGNRIVQEFYIDPMTPVNARYEFGQALFTGWAAASLCLLGGALLCCSC 
PRKTTSYPTPRPYPKPAPSSGKDYV 

Important features: 
Signal peptide: 

amino acids 1-21 

Transmembrane domains: 

amino acids 82-102, 118-142 and 161-187 

N-glycosylation site. 

amino acids 72-75 

PMP-22 / EMP / MP20 family proteins 

amino acids 70-111 

ABC-2 type transport systesm integral membrane protein 

amino acids 119-133 



TTCTGGCCAAACCCGGGGCTNCAGCTGTTGGGCTTCATCTCGCCTTCCTGGGATGGATCGGC 
GCCATCNTCACACTGCCCTTCCCCAGTGGAGGATTTTACTCCCTATGCTGGCGACAACATCG 
TGACCGCCCAGCCCATGTACGAGGGC5CTGTGGATGTCCNGCGTGTCGCAGAGCACCGGGCAG 
ATCCAGTGCAAAGTCTTTGACTCCTTGCTGAATCTGAGCAGCACATTGCAAGCAACCCGTGC 
CTTGATGGTGGTTGGCATCCTCCTGC3GAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGA 
AGTGTATGAAGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGC 
GCGATATTTCTTCTTGCAGGTCTGGCTATTTTAGTTGCCACAGCATGGTATGGCAATAGAAN 
CNTTCAACANTTCTATGACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCA 
GGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCT 
GTTCCTGTCCC 



FIGURE 100 



ACCCTTGACCCAACGCGGCCCCCCG5ACCGNTTCATGGCCAAACGCGGGNCTCCAGCTGTTGG 
GCTTCATTCTCCCCTTCCTGGGAT<3GACCGGCGCCCATCNTCAGCACTGCCCTGCCCCAGTG 
GAGGATTTACTCCTATNCCGGCNACT^CATCGTGACCGCCCAGGCCNTGTACGAGGGGCTGT 
GGATGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCTTTGACTCCCTTGCT 
GAATCTGAGCAGCACATTGCAAGC^iACCCGTGCCTTGATGGTGGTTGGCATCCTCCTGGGAG 
TGATAGCAATCTTNNTGGCCACCGTTGTNNNTGAAGTGTATGAAGTGCTTGGAAGACGATGA 
GGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTTCTTGCAGGTCTGGCTA 
TTTTAGTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAATTCTATGACCCTATGACCGA 



FIGURE 101 



GGGCCCGACCATTATCCAACCGGGNTCACTGTTGGCTCATCTCCCTCCTGGATGAANCGCGC 
CATCNTCAGACTCCCTGCCCCATGGIAGATTTENCCTATGCTGGCGACAACATCNTGACCCCC 
AGCCATGTACGAGGGGCTTTGAACGTCNGCGTGTCGCAGANCACCGGGCAGATCCAGTGCAA 
AGTCTTTGACTCCTTGCTGAATCTGfNGCAGCACATTGCAGCAACCCNTGCCCTGATGGTGGT 
TGGCATCCTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGT 
GCTTGGAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTT 
CTTGCAGGTCTGGCTATTTNNNGTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAAT 
TCTATGACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGC 
TGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCTGTTCCTGCGA 



ATTCTCCCCTCCTGGATGGATCGCNCCACCGTCACATTGCCTTCCCCCANTGGAGGATTNAC 

TCCTATGCTGGCGACAACATCGTGACCCCCCAGGCCATTTACCGAGGGGCTTTGGATGTCNT 

GCNTGTCGCAGAGCACCGGGCAGATCCCAGTGCAAAGTCTTTGACTCCTTGCTGAATCTGAG 

CAGCACATTGCAAGCAACCCGTGCCTTGATGGGGTTGGCATCCTCCTGGGAGTGATAGCAAC 

CTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGTGCTTGGAAGACGATGAGGTGCCAGAAG 

ATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTTGTTGCAGGTCTGGCTATTTTAGTNGC 

CACAGCATGGTATGGCAATAGANTMfNTTOINGMSINTCTATGACCCTATGACC 

CCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTG 

GGAGGTGCCCTACTTTGCTGTTCCTGTCCC 



FIGURE 103 



AGAGCACCGGCAGATCCCAGTNCAAAGTCTTTGACCCTTGCTGAATCTGAGCAGCACATTNC 
AAGCAACCCCTTGCCTTGAAGGTGGTTGNCATCCCCCCTGGGAGTGAATAGCAATCTTTGTG 
GCCACCGTTGGCATGAAGTNTATGA^GTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGAT 
GGCTGTCATTGGGGGCGCGATATTTCTTCTTGCAGGTCTGGCTATTTTAGTNNCCACAGCAT 
GGTATGGCAATAGNATKNTTCGNGGNTTCTATGACCCTATGACCCCAGTCAATGCCAGGTAC 
GAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGC 
CCTACTTTGCTGTTCCTGTCCCCGAi\ 



FIGURE 104 



AGCAATGCCCTGCCCCCAGTGGAGGATTAATTCCTATGNTGGGGACAACATTGTGACNGCCC 
AGGCCATGTACGGGGGGCTGTGGATGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGC 
AAAGTNTTTGACTCCTTGCTGAATTTGAGCAGCACATTGCAAGCAACCCGTGCCTTGATGGT 
GGTTGGCATCTTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTGGNAATGAAGTGTATGA 
AGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTT 
CTTNTTGCAGGTCTGGCTATTTTAGTTGCCACAGCATGGTATGGCAATAGAATNGTTCAAGA 
ATTTTATGACCCTATGACCCCAGTCiiiATGCCAGGTACGAATTTGGTCAGGCTTTNTTCACTG 
GCTGGGCTGCTGCTTNTTTCTGCCTI'NTGGGAGGTGCCCTANTTTGCTGTTCCTGCGAACC 



FIGURE 105 

TCATAGGGGGGCGCGATATTTTTTCTTGCAGGTNTGGTTATTTTAGTTGCCACAGCATGGTA 
TGGCAATAGAATCGTTCAAGAATTNTATGACCCTATGACCCCAGTCAATGCCAGGTACGAAT 
TTGGTCAGGCTCTNTTCACTGGNTGGGCTGCTGCTTCTNTNNGCCTTNTGGGAGGTGCCCTA 
CTTTGCTGTTCCTG 



FIGURE 106 



TTCCTGGGATGGATCCGCCCCCATCNTCACATGCCCTGCCCCNTGGAGATTTACNCCTATGC 
TGGCGAACAACATCNTGACCGCCCAGGCCATGTACGAGGGGCTGTGGAATGTCCTGCGTGTC 
CCAGAGCACCGGGCAGATCCAGTGC.?IASlGTCTTTGACTCCTTGCTGAATCTGAGCAGCACAT 

tgcaagcaaccntgccttgatggtgigttggcatcctcctgggagtgatagcaatctttgtgg 
ccaccgttggcatgaaagtgtatgaagtgcttggaagacgatgaggtgcagaagatgaggat 
ggctgtcattgggggcgcgatatttcttcttgcaggtctggctattttagnngccacagcat 
ggtatggcaatcagacccnntcanaaactctatgaccctatgaccccagtcaatgccaggta 
cgaatttggtcaggctctcttcactggctgggctgctgcttctctctgccttctgggaggtg 
ccctactttgctgttcctgtccccgil^aaaacaacctcttacccacg 



FIGURE 107 



CGGGGCTGCAGCTGTTGGGCTTCATCTCGCTTCCTGGGATGGAATCGGCGCCATCGTCAGCA 
CTGCCCTGCCCCATGGAGGATTTACTCNTATGCTGGCGACAACATCGTGACCNCCCAGGCCA 
TGTACGAGGGGCTGTGGATGTCNGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCT 
TTGACTCCTTGCTGAATCTGAGCAGiCACATTGCAAGCAACCNTGCCTTGATGGTGGTTGGCA 
TCCTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGTGCTTG 
GAAGACGATGAGGTGCAGAAGATGAC3GATGGCTGTCATTGGGGGCGCGATATTTCTTCTTGC 
AGGTCTGGCTATTTNTAGTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAATTCTAT 
GACCCTATGACCCCAGTCAATGCCAGJGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGC 
TGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCTGTTCCTGCGAA 



gcgtgccgtcagctcgccgggcaccgcggcctcgccctcgccctccgcccctgcgcctgcac 
cgcgtagaccgacccccccctccagtcgcgcccacccggtagaggacccccgcccgtgccccg 
accggtccccgcctttttgtaaaacttaaagcgggcgcagcattaacgcttcccgccccggt 
gacctctcaggggtctccccgccaaaggtgctccgccgctaaggaacatggcgaaggtggag 
caggtcctgagcctcgagccgcagcacgagctcaaattccgaggtcccttcaccgatgttgt 
caccaccaacctaaagcttggcaac:ccgacagaccgaaatgtgtgttttaaggtgaagacta 
cagcaccacgtaggtactgtgtgaggcccaacagcggaatcatcgatgcaggggcctcaatt 
aatgtatctgtgatgttacagcctttcgattatgatcccaatgagaaaagtaaacacaagtt 
tatggttcagtctatgtttgctccaactgacacttcagatatggaagcagtatggaaggagg 
caaaaccggaagaccttatggattc:aaaacttagatgtgtgtttgaattgccagcagagaat 
gataaaccacatgatgtagaaataaataaaattatatccacaactgcatcaaagacagaaac 
accaatagtgtctaagtctctgagttcttctttggatgacaccgaagttaagaaggttatgg 
aagaatgtaagaggctgcaaggtgaagttcagaggctacgggaggagaacaagcagttcaag 

GAAGAAGATGGACTGCGGATGAGG^AGACAGTGCAGAGCAACAGCCCCATTTCAGCATTAGC 
CCCAACTGGGAAGGAAGAAGGCCTTAGCACCCGGCTCTTGGCTCTGGTGGTTTTGTTCTTTA 

tcgttggtgtaattattgggaagattgccttgtagaggtagcatgcacaggatggtaaattg 
gattggtggatccaccatatcatgggatttaaatttatcataaccatgtgtaaaaagaaatt 
aatgtatgatgacatctcacaggtcttgcctttaaattacccctccctgcacacacatacac 
agatacacacacacaaatataatgtaacgatcttttagaaagttaaaaatgtatagtaactg 
attgagggggaaaaagaatgatctttattaatgacaagggaaaccatgagtaatgccacaat 
ggcatattgtaaatgtc:attttaaft.cattggtaggccttggtacatgatgctggattacctc 
tcttaaaatgacacccttcctcgcctgttggtgctggcccttggggagctggagcccagcat 
gctggggagtgcggtcagctccaca'cagtagtccccacgtggcccactcccggcccaggctg 
ctttccgtgtcttcagttctgtccaagccatcagctccttgggactgatgaacagagtcaga 
agcccaaaggaattgcactgtggca.gcatcagacgtactcgtcataagtgagaggcgtgtgt 
tgactgattgacccagcgctttggaaataaatggcagtgctttgttcacttaaagggaccaa 
gctaaatttgtattggttcatgtagtgaagtcaaactgttattcagagatgtttaatgcata 
tttaacttatttaatgtatttcatctcatgttttcttattgtcacaagagtacagttaatgc 
tgcgtgctgctgaactctgttgggt'gaactggtattgctgctggagggctgtgggctcctct 
gtctctggagagtctggtcatgtggaggtggggtttattgggatgctggagaagagctgcca 

GGAAGTGTTTTTTCTGGGTCAGTAAATAACAACTGTCATAGGGAGGGAAATTCTCAGTAGTG 

acagtcaactctaggttaccttttttaatgaagagtagtcagtcttctagattgttcttata 
ccacctctcaaccattactcacacttccagcgcccaggtccaagtctgagcctgacctcccc 
ttggggacctagcctggagtcaggaciaaatggatcgggctgcagagggttagaagcgagggc 
accagcagttgtgggtggggagcaagggaagaga6aaactcttcagcgaatccttctagtac 
tagttgagagtttgactgtgaattaattttatgccataaaagaccaacccagttctgtttga 
ctatgtagcatcttgaaaagaaaaattataataaagccccaaaattaagaaaa 



F3CGURE 109 

</usr/seqdb2/sst/DNA/Dnasegs -min/ss .DNA53977 
<siibunit 1 of 1, 243 aa, 1 stop 
<MW: 27228, pi: 7,43, NX(S/T): 2 

MAKVEQVLSLEPQHELKFRGPFTDVS/TTNLKLGNPTDRNVCFKVKTTAPRRYCVRPNSGIID 
AGASIKTVSVMLQPFDYDPNEKSKHKiriWQSMFAPTDTSDMEAWKEAKPEDLMDSKLRCVFE 
LPAENDKPHDVEINKIISTTASKTETPIVSKSLSSSLDDTEVKKVMEECKRLQGEVQRLREE 
NKQFKEEDGLRMRKTVQSNS P I S AL7iPTGKEEGLSTRLLALWLFFI VGVI IGKI AL 

Important features: 
Transmembrane domain: 

amino acids 224-239 

N-glycosylation site. 

amino acids 68-71 

N-myristoylation site. 

amino acids 59-64, 64-69 and 235-240 



FIGURE 11 



GTCAGTCTTCTAGATTGTCCTTATCCCACCTTTCAACCANTACTCACATTTCNAGCGCCCAG 
GTCCANGTCTGAGCCTGACTTCCCCTTGGGGACCTAGCCTGGAGTCAGGACAATGGNTCGGG 
CTGCAGAGGNTTAGAAGCGAGGGCACCAGCAGTTTTGGGTGGGGAGCAAGGGNNGAGAGAAA 
CTCTTCAGCGAATCCTTCTAGTACTAGTTGAGAGTTTGACTGTGAATTAATTTTATGCCATA 
AAAGACNAACCCAGTTCTGTTTGACTATGTAGCATCTTGAAAAGAAAAATTATAATAAAGCC 
CCAAAATTAAGAATTCTTTTGTCATTTTGTCACATTTGCTCTATGGGGGGAATTATTATTTT 
ATCATTTTTATTATTTTGCCATTGGJyiGGTTAACTTTAAAATGAGC 



FIGURE 111 



TATTGTAAAGGCCATTTTAAACCATTGGTAGGCCTTGGTACATGATGCTGGATTACCTCCTT 
AAATGACACCNTTCCTCGCCTGTTGGTGCTGGCCNTTGGGGAGCTGGAGCCCCAGCATGCTG 

gggagtgcggtcagctccacacagtagtccccacgtggcccactcccggcccaggctgcttt 
ccgtgtcttcagttctgtccaagcc:atcagctccttgggactgatgaacagagtcagaagcc 

CAAAGGAATTGCCACTGTGGCAGCATCAGACGTACTCGTCATAAGTGAGAGGCGTGTGTTGA 

ctgattgacccagcgctttggaaataaatggcagtgctttgttcacttaaagggaccaagct 

AAATTGTATTGGTTCATGTAGTGAJSiGTCAAACTGTTATTCAGAGATGTTTAATGCATATTTA 

acttatttaatgtatttcatctcatgttttcttattgtcacaagagtacagttaatgctgcg 
tgctgctgaactctgttgggtgaactggtattgctgctggagggctg 



FIGURE 112 



CCCTGGTGGTTTTGTTCTTTAATTCIGTTGGTGTAATTNTTGGGAAGATTGCTTGTAGAGGTA 
GNATGCACCNGGCTGGTAAATTGGATTGGTGGATCCACCATATCCATGGGATTTAAATTTAT 
CATAACCATGTGTAAAAAGAAATTAATGTATGATGACATNTCACAGGTATTGCCTTTAAATT 
ACCCATCCCTGNANACACATACACAGATACACANANACAAATNTAATGTAACGATNTTTTAG 
AAAGTTAAAAATGTATAGTAAC 



FIGURE 113 



GGTGGCCCATTCCCGGCCCAGGCTGCTTTCCGGTNTTCAGTTCTGTCCAAGCCATCAGCTCC 
TTGGGACTGATGAACAGAGTCAGAAGCCCAAAGGAATTGCACTGTGGCAGCATNAGACGTAC 
TTGTNATAAGTGAGAGGCGTGTGTTCSACTGATTGACCCAGCGCTTTGGAAATAAATGGCAGT 
GCTTTGTTCANTTAAAGGGACCAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTG 
TTATTCAGAGATGTTTAATGCATATTTAANTTATTTAATGTATTTNATNTCATGTTTTCTTA 
TTGTCACAAGAGTACAGTTAATGCTGCGTGCTGCTGAANTNTGTTGGGTGAACTGGTATTGC 
TGCTGGAGGGCTGTGGGCTCCTCTGTCTTTGGAGAGTCTGGTCATGTGGAGGTGGG 



FIGURE 114 



TGCTTTCCGTGTCTTCAGTTCTGTC:CAAGCCATCAGCTCCTTGGGACTTGATGAACAGAGTC 
AGAAGCCCAAAGGAATTGCACTGTGGCAGCATCAGACGTACTCGTCATAAGTGAGAGGCGTG 
TGTTGACTGATTGACCCAGCGCTTT'GGAAATAAATGGCAGTGCTTTGTTCACTTAAAGGGAC 
CAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTGTTATTCAGAGATGTTTAATGC 
ATATTTAACTTATTTAATGTATTTCATCTCATGTTTTCTTATTGTCACAAGAGTACAGTTAA 
TGCTGCGTGC 



FIGURE 115 



AAACCTTTAAAAGTTGAGGGGAAAAGAATGATCCTTTATTAATGACAAGGGAAACCNTGNGT 
AATGCCACAATGGCATATTGTAAATGTCATTTTAAACATTGGTAGGCCTTGGTACATGATGC 
TGGATTACCTCTCTTAAAATGACACCCTTCCTCGCCTGTTGGTGCTGGCCCTTGGGGAGCTN 
GAGCCCAGCATGCTGGGGAGTGCGGTCTGCTCCACACAGTAGTCCCCANGTGGCCCANTCCC 
GGCCCAGGCTGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGANTGATGA 
ACAGAGTCAGAAGCCCAAAGGAATTGCANTGTGGCAGCATCAGANGTANTNGTCATAAGTGA 
GAGGCGTGTGTTGANTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCANTT 
AAAGGGNCCAAGNTAAATTTGTATTG6TTCATGTAGTGAAGTCAAANTGTTATTCAGAGATG 
TTTAATGCa.TATTTAANTTATTTAATGTATTTCATNTCATGTTTTCTTATTGTCACAAGGGT 
ACAGTTAATGCTGCGTGCTGCTGAAl^CTGTTGGGTGAANTGGTATTGCTG 



FIGURE 116 



GGCCCTTGGGGAGCTGGAGCCCAGCATGCTGGGGAGTGCGGTCAGCTCCACACAGTAGTCCC 
CACGTGGCCCACTCCCGGCCCAGGCTGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGC 
TCCTTGGGACTGATGAACAGAGTCAGAAGCCCAAAGGAATTGCACTGTGGCAGCATCAGACG 
TACTCGTCATAAGTGAGAGGCGTGI'GTTGACTGATTGACCCAGCGCTTTGGAAATAAATGGC 
AGTGCTTTGTTCACTTAAAGGGACCAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAA 
CTGTTATTCAGAGATGTTTAATGCATATTTAACTTATTTAATGTATTTCATCTCATGTTTTC 
TTATTGTCACAAGAGTACAGTTAAl'GCTGCGTGCTGCTGAACTCTGTTGGGTGAACTGGTAT 
TGCTGCTGGAGGGCTGTGGGCTCCTCTGTCTCTGGAGAGTCTGGTCATGTGGAGGTGGG 



FIGURE 117 

GCGAGCTCCGGGTGCTGTGGCCCGGCCTTGGCGGGGCGGCCTCCGGCTCAGGCTGGCTGAGA 
GGCTCCCAGCTGCAGCGTCCCCGCCCGCCTCCTCGGGAGCTCTGATCTCAGCTGACAGTGCC 
CTCGGGGACCAAACAAGCCTGGCAGGGTCTCACTTTGTTGCCCAGGCTGGAGTTCAGTGCCA 
TGATCATGGTTTACTGCAGCCTTGACCTCCTGGGTTCAAGCGATCCTGCTGAGTAGCTGGGA 
CTACAGGACAAAATTAGAAGATCAA?^ATGGAAAATATGCTGCTTTGGTTGATATTTTTCACC 
CCTGGGTGGACCCTCATTGATGGATCTGAAATGGAATGGGATTTTATGTGGCACTTGAGAAA 
GGTACCCCGGATTGTCAGTGAAAGGACTTTCCATCTCACCAGCCCCGCATTTGAGGCAGATG 
CTAAGATGATGGTAAATACAGTGTGTGGCATCGAATGCCAGAAAGAACTCCCAACTCCCAGC 
CTTTCTGAATTGGAGGATTATCTTTCCTATGAGACTGTCTTTGAGAATGGCACCCGAACCTT 
AACCAGGGTGAAAGTTCAAGATTTGGTTCTTGAGCCGACTCAAAATATCACCACAAAGGGAG 
TATCTGTTAGGAGAAAGAGACAGGTGTATGGCACCGACAGCAGGTTCAGCATCTTGGACAAA 
AGGTTCTTAACCAATTTCCCTTTCA(3CACAGCTGTGAAGCTTTCCACGGGCTGTAGTGGCAT 
TCTCATTTCCCCTCAGCATGTTCTAACTGCTGCCCACTGTGTTCATGATGGAAAGGACTATG 
TCAAAGGGAGTAAAAAGCTAAGGGTAGGGTTGTTGAAGATGAGGAATAAAAGTGGAGGCAAG 
AAACGTCGAGGTTCTAAGAGGAGCAGGAGAGAAGCTAGTGGTGGTGACCAAAGAGAGGGTAC 
CAGAGAGCATCTGCAGGAGAGAGCG;?kAGGGTGGGAGAAGAAGAAAAAAATCTGGCCGGGGTC 
AGAGGATTGCCGAAGGGAGGCCTTCCTTTCAGTGGACCCGGGTCAAGAATACCCACATTCCG 
AAGGGCTGGGCACGAGGAGGCATGGGGGACGCTACCTTGGACTATGACTATGCTCTTCTGGA 
GCTGAAGCGTGCTCACAAAAAGAAATACATGGAACTTGGAATCAGCCCAACGATCAAGAAAA 
TGCCTGGTGGAATGATCCACTTCTCAGGATTTGATAACGATAGGGCTGATCAGTTGGTCTAT 
CGGTTTTGCAGTGTGTCCGACGAATCCAATGATCTCCTTTACCAATACTGCGATGCTGAGTC 
GGGCTCCACCGGTTCGGGGGTCTATCTGCGTCTGAAAGATCCAGACAAAAAGAATTGGAAGC 
GCAAAATCATTGCGGTCTACTCAGGGCACCAGTGGGTGGATGTCCACGGGGTTCAGAAGGAC 
TACAACGTTGCTGTTCGCATCACTCCCCTAAAATACGCCCAGATTTGCCTCTGGATTCACGG 
GAACGATGCCAATTGTGCTTACGGCTAACAGAGACCTGAAACAGGGCGGTGTATCATCTAAA 
TCACAGAGAAAACCAGCTCTGCTTACCGTAGTGAGATCACTTCATAGGTTATGCCTGGACTT 
GAACTCTGTCAATAGCATTTCAACATTTTTCAAAATCAGGAGATTTTCGTCCATTTAAAAAA 
TGTATAGGTGCAGATATTGAAACTAGGTGGGCACTTCAATGCCAAGTATATACTCTTCTTTA 
CATGGTGATGAGTTTCATTTGTAGAAAAATTTTGTTGCCTTCTTAAAAATTAGACACACTTT 
AAACCTTCAAACAGGTATTATAAATJaLACATGTGACTCCTTAATGGACTTATTCTCAGGGTCC 
TACTCTAAGAAGAATCTAATAGGATGCTGGTTGTGTATTAAATGTGAAATTGCATAGATAAA 
GGTAGATGGTAAAGCAATTAGTATCAGAATAGAGACAGAAAGTTACAACACAGTTTGTACTA 
CTCTGAGATGGATCCATTCAGCTCATGCCCTCAATGTTTATATTGTGTTATCTGTTGGGTCT 
GGGACATTTAGTTTAGTTTTTTTGAi\GAATTACAAATCAGAAGAAAAAGCAAGCATTATAAA 
CAAAACTAATAACTGTTTTACTGCTTTAAGAAATAACAATTACAATGTGTATTATTTAAAAA 
TGGGAGAAATAGTTTGTTCTATGAAATAAACCTAGTTTAGAAATAGGGAAGCTGAGACATTT 
TAAGATCTCAAGTTTTTATTTAACT.AATACTCAAAATATGGACTTTTCATGTATGCATAGGG 
AAGACACTTCACAAATTATGAATGATCATGTGTTGAAAGCCACATTATTTTATGCTATACAT 
TCTATGTATGAGGTGCTACATTTTTAGGACAAAGAATTCTGTAATCTTTTTCAAGAAAGAGT 
CTTTTTCTCCTTGACAAAATCCAGCTTTTGTATGAGGACTATAGGGTGAATTCTCTGATTAG 
TAATTTTAGATATGTCCTTTCCTAAJWITGAATAAAATTTATGAATATGA 
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< /us r / seqdb2 / s s t /DNA/Dnas eqs . min/ s s . DKTAS 7 2 5 3 
<subunit 1 of 1, 413 aa, 1 stop 
<MW: 47070, pi: 9.92, NX(I3/T): 3 

MENMLLWLIFFTPGWTLIDGSEMEWDFMWHLRKVPRIVSERTFHLTSPAFEADAKMMW 

GIECQKELPTPSLSELEDYLSYETVFENGTRTLTRVKYQDLVLEPTQNITTKGVSVRRKRQV 

YGTDSRFSILDKRFLTNFPFSTAVKLSTGCSGILISPQHVLTAAHCVHDGKDYVKGSKKLRV 

GLLKMRNKSGGKKRRGSKRSRREASGGDQREGTREHLQERAKGGRRRKKSGRGQRIAEGRPS 

FQWTRVKNTHIPKGWARGGMGDATLDYDYALLELKRAHKKKYMELGISPTIKKMPGGMIHFS 

GFDNDRADQLVYRFCSVSDESNDLLYQYCDAESGSTGSGVYLRLKDPDKKNWKRKIIAVYSG 

HQWVDYHGVQKDYNVAWITPLKYAQICLWIHGNDANCAYG 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 90-93, 110-113 and 193-196 

Glycosaminoglycan attachmsint site, 
amino acids 236-239 

Serine proteases, trypsin family, histidine active site. 

amino acids 165-170 
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AATGTGAGAGGGGCTGATGGAAGCTCJATAGGCAGGACTGGAGTGTTAGCACCAGTACTGGAT 
GTGACAGCAGGCAGAGGAGCACTTAGCAGCTTATTCAGTGTCCGATTCTGATTCCGGCAAGG 
ATCCAAGCATGGAATGCTGCCGTCGCXJCAACTCCTGGCACACTGCTCCTCTTTCTGGCTTTC 
CTGCTCCTGAGTTCCAGGACCGCACC5CTCCGAGGAGGACCGGGACGGCCTATGGGATGCCTG 
GGGCCCATGGAGTGAATGCTCACGCACCTGCGGGGGAGGGGCCTCCTACTCTCTGAGGCGCT 
GCCTGAGCAGCAAGAGCTGTGAAGGiivAGAAATATCCGATACAGAACATGCAGTAATGTGGAC 
TGCCCACCAGAAGCAGGTGATTTCCGAGCTCAGCAATGCTCAGCTCATAATGATGTCAAGCA 
CCATGGCCAGTTTTATGAATGGCTTCCTGTGTCTAATGACCCTGACAACCCATGTTCACTCA 
AGTGCCAAGCCAAAGGAACAACCCTC5GTTGTTGAACTAGCACCTAAGGTCTTAGATGGTACG 
CGTTGCTATACAGAATCTTTGGATATGTGCATCAGTGGTTTATGCCAAATTGTTGGCTGCGA 
TCACCAGCTGGGAAGCACCGTCAAGGJAAGATAACTGTGGGGTCTGCAACGGAGATGGGTCCA 
CCTGCCGGCTGGTCCGAGGGCAGTATAAATCCCAGCTCTCCGCAACCAAATCGGATGATACT 
GTGGTTGCACTTCCCTATGGAAGTAGACATATTCGCCTTGTCTTAAAAGGTCCTGATCACTT 
ATATCTGGAAACCAAAACCCTCCAGC3GGACTAAAGGTGAAAACAGTCTCAGCTCCACAGGAA 
CTTTCCTTGTGGACAATTCTAGTGTGGACTTCCAGAAATTTCCAGACAAAGAGATACTGAGA 
ATGGCTGGACCACTCACAGCAGATTi:CATTGTCAAGATTCGTAACTCGGGCTCCGCTGACAG 
TACAGTCCAGTTCATCTTCTATCAACCCATCATCCACCGATGGAGGGAGACGGATTTCTTTC 
CTTGCTCAGCAACCTGTGGAGGAGGTTATCAGCTGACATCGGCTGAGTGCTACGATCTGAGG 
AGCAACCGTGTGGTTGCTGACCAATACTGTCACTATTACCCAGAGAACATCAAACCCAAACC 
CAAGCTTCAGGAGTGCAACTTGGATCCTTGTCCAGCCAGTGACGGATACAAGCAGATCATGC 
CTTATGACCTCTACCATCCCCTTCCTCGGTGGGAGGCCACCCCATGGACCGCGTGCTCCTCC 
TCGTGTGGGGGGGGCATCCAGAGCCGGGCAGTTTCCTGTGTGGAGGAGGACATCCAGGGGCA 
TGTCACTTCAGTGGAAGAGTGGAAATGCATGTACACCCCTAAGATGCCCATCGCGCAGCCCT 
GCAACATTTTTGACTGCCCTAAATGC3CTGGCACAGGAGTGGTCTCCGTGCACAGTGACATGT 
GGCCAGGGCCTCAGATACCGTGTGGTCCTCTGCATCGACCATCGAGGAATGCACACAGGAGG 
CTGTAGCCCAAAAACAAAGCCCCACATAAAAGAGGAATGCATCGTACCCACTCCCTGCTATA 
AACCCAAAGAGAAACTTCCAGTCGACJGCCAAGTTGCCATGGTTCAAACAAGCTCAAGAGCTA 
GAAGAAGGAGCTGCTGTGTCAGAGGAGCCCTCGTAAGTTGTAAAAGCACAGACTGTTCTATA 
TTTGAAACTGTTTTGTTTAAAGAAAGCAGTGTCTCACTGGTTGTAGCTTTCATGGGTTCTGA 
ACTAAGTGTAATCATCTCACCAAAGCTTTTTGGCTCTCAAATTAAAGATTGATTAGTTTCAA 
AAAAAAAAA 
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< /us r / seqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA5 8847 
<siibunit 1 of 1, 525 aa, 1 stop 
<MW: 58416, pi: 6.62, NX{S/T): 1 

MECCRRATPGTLLLFLAFLLLSSRTi^SEEDRDGLWDAWGPWSECSRTCGGGASYSLRRCLS 
SKSCEGRNIRYRTCSNVDCPPFAGDFRAQQCSAHNDVKHHGQFYEWLPVSNDPDNPCSLKCQ 
AKGTTLWELAPKVLDGTRCYTESLDMCISGLCQIVGCDHQLGSTVKEDNCGVCNGDGSTCR 
LVRGQYKSQLSATKSDDTWALPYGSRHIRLVLKGPDHLYLETKTLQGTKGENSLSSTGTFL 
VDNSSVDFQKFPDKEILRMAGPLTADFIVKIRNSGSADSTVQFIFYQPIIHRWRETDFFPCS 
ATCGGGYQLTSAECYDLRSNRWADQYCHYYPENIKPKPICLQECNLDPCPASDGYKQIMPYD 
LYHPLPRWEATPWTACSSSCGGGIQS5RAVSCVEEDIQGHVTSVEEWKCMYTPKMPIAQPCNI 
FDCPKWLAQEWSPCTVTCGQGLRYRWLCIDHRGMHTGGCSPKTKPHIKEECIVPTPCYKPK 
EKLPVEAKLPWFKQAQELEEGAAVSEEPS 

Important features : 
Sigmal peptide: 

amino acids 1-25 

N-glycosylation site. 

amino acids 251-254 

Thrombospondin 1 

amino acids 385-399 

von Willebrand factor type C domain proteins 

amino acids 385-399, 445-459 and 42-56 
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CGGACGCGTGGGCGGCGGCTGCGGASlCTCCCGTGGAGGGGCCGGTGGGCCCTCGGGCCTGAC 
AGATGGCAGTGGCCACTGCGGCGGCAGTACTGGCCGCTCTGGGCGGGGCGCTGTGGCTGGCG 
GCCCGCCGGTTCGTGGGGCCCAGGGTCCAGCGGCTGCGCAGAGGCGGGGACCCCGGCCTCAT 
GCACGGGAAGACTGTGCTGATCACCGGGGCGAACAGCGGCCTGGGCCGCGCCACGGCCGCCG 
AGCTACTGCGCCTGGGAGCGCGGGTGATCATGGGCTGCCGGGACCGCGCGCGCGCCGAGGAG 
GCGGCGGGTCAGCTCCGCCGCGAGCTCCGCCAGGCCGCGGAGTGCGGCCCAGAGCCTGGCGT 
CAGCGGGGTGGGCGAGCTCATAGTCCGGGAGCTGGACCTCGCCTCGCTGCGCTCGGTGCGCG 
CCTTCTGCCAGGAAATGCTCCAGGAAGAGCCTAGGCTGGATGTCTTGATCAATAACGCAGGG 
ATCTTCCAGTGCCCTTACATGAAGACTGAAGATGGGTTTGAGATGCAGTTCGGAGTGAACCA 
TCTGGGGCACTTTCTACTCACCAATCTTCTCCTTGGACTCCTCAAAAGTTCAGCTCCCAGCA 
GGATTGTGGTAGTTTCTTCCAAACTTTATAAATACGGAGACATCAATTTTGATGACTTGAAC 
AGTGAACAAAGCTATAATAAAAGCTTTTGTTATAGCCGGAGCAAACTGGCTAACATTCTTTT 
TACCAGGGAACTAGCCCGCCGCTTAGAAGGCACAAATGTCACCGTCAATGTGTTGCATCCTG 
GTATTGTACGGACAAATCTGGGGAGGCACATACACATTCCACTGTTGGTCAAACCACTCTTC 
AATTTGGTGTCATGGGCTTTTTTCAAAACTCCAGTAGAAGGTGCCCAGACTTCCATTTATTT 
GGCCTCTTCACCTGAGGTAGAAGGAGTGTCAGGAAGATACTTTGGGGATTGTAAAGAGGAAG 
AACTGTTGCCCAAAGCTATGGATGAATCTGTTGCAAGAAAACTCTGGGATATCAGTGAAGTG 
ATGGTTGGCCTGCTAAAATAGGAACi^GGAGTAAAAGAGCTGTTTATAAAACTGCATATCAG 
TTATATCTGTGATCAGGAATGGTGTGGATTGAGAACTTGTTACTTGAAGAAAAAGAATTTTG 
ATATTGGAATAGCCTGCTAAGAGGTACATGTGGGTATTTTGGAGTTACTGAAAAATTATTTT 
TGGGATAAGAGAATTTCAGCAAAGATGTTTTAAATATATATAGTAAGTATAATGAATAATAA 
GTACAATGAAAAATACAATTATATTGTAAAATTATAACTGGGCAAGCATGGATGACATATTA 
ATATTTGTCAGAATTAAGTGACTCA?^GTGCTATCGAGAGGTTTTTCAAGTATCTTTGAGTT 
TCATGGCCAAAGTGTTAACTAGTTTTACTACAATGTTTGGTGTTTGTGTGGAAATTATCTGC 
CTGGTGTGTGCACACAAGTCTTACTTGGAATAAATTTACTGGTAC 



FIGURE 122 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss-DNA58747 
<subunit 1 of 1, 336 aa, 1 stop 
<MW: 36865, pi: 9.15, NX(S/T): 2 

MAVATAAAVLAALGGALWLAARRFVGPRVQRLRRGGDPGLmGKTVLITGANSGLGRATAAE 
LLRLGARVIMGCRDRARAEEAAGQLFLRELRQAAECGPEPGVSGVGELIVRELDLASLRSVRA 
FCQEMLQEEPRLDVL INNAG I FQCP YMKTEDGFEMQFGVNHLGHFLLTNLLLGLLKS SAPSR 
IVWSSKLYKYGDINFDDLNSEQSYNKSFCYSRSKIJmiLFTREIJmRLEGTNVTVNVLHPG 
IVRTNLGRHIHIPLLVKPLFNLVSWAFFKTPVEGAQTSIYLASSPEVEGVSGRYFGDCKEEE 
LL PKAMDES VARKLWD I SE VMVGLLK 

Important features: 
Signal peptide: 

amino acids 1-21 

Short-chain alcohol dehydrogenase family protein 

amino acids 134-144, 44-56 and 239-248 

N-glycosylation site. 

amino acids 212-215 and 239-242 
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GGGGATTGTAAAGAGGAAGNACTGTGCCCAAAGNTATGGATGAATCTGTTGCAAGAAAATTN 
TGGGATATCAGTGAAGTGATGGTTKrGCCTGCTAAAATAGGAACAAGGAGTAAAAGAGCTGTT 
TATAAAACTGCATATCAGTTATATCTGTGATCAGGAATGGTGTGGATTGAGAACTTGTTACT 
TGAAGAAAAAGAATTTTGATATTGGAATAGCCTGNTAAGAGGNACATGTGGGTATTTTGGAG 
TTACTGAAAAATTATTTTTGGGATAAGAGAATTTCAGCAAAGATGTTTTAAATATATATAGT 
AAGTATAATGAATAATAAGTACAATGAAAAATACAATTATATTGTAAAATTATAACTGGGCA 
AGCATGGATGACATATTl^TATTTGiTCAGAATTAAGTGACTCAAAGTGCTATCGAGAGGTTT 
TTCAAGTATCTTTGAGTTTCATGGCCAAAGTGTTAACTAGTTTTACTACAATGTTTGGTGTT 
TGTGTGGAAATTATCTGCCTGGCTT 



GAGAGGACGAGGTGCCGCTGCCTGGAGAATCCTCCGCTGCCGTCGGCTCCCGGAGCCCAGCC 
CTTTCCTAACCCAACCCAACCTAGCC:CAGTCCCAGCCGCCAGCGCCTGTCCCTGTCACGGAC 
CCCAGCGTTACCATGCATCCTGCCGl'CTTCCTATCCTTACCCGACCTCAGATGCTCCCTTCT 
GCTCCTGGTAACTTGGGTTTTTACTC:CTGTAACAACTGAAATAACAAGTCTTGCTACAGAGA 
ATATAGATGAAATTTTAAACAA.TGC']?GATGTTGCTTTAGTAAATTTTTATGCTGACTGGTGT 
CGTTTCAGTCAGATGTTGCATCCAATTTTTGAGGAAGCTTCCGATGTCATTAAGGAAGAATT 
TCCAAATGAAAATCAAGTAGTGTTTGCCAGAGTTGATTGTGATCAGCACTCTGACATAGCCC 
AGAGATACAGGATAAGCAAATACCCi^ACCCTCAAATTGTTTCGTAATGGGATGATGATGAAG 
AGAGAATACAGGGGTCAGCGATCAGI'GAAAGCATTGGCAGATTACATCAGGCAACAAAAAAG 
TGACCCCATTCAAGAAATTCGGGACTTAGCAGAAATCACCACTCTTGATCGCAGCAAAAGAA 
ATATCATTGGATATTTTGAGCAAAAGGACTCGGACAACTATAGAGTTTTTGAACGAGTAGCG 
AATATTTTGCATGATGACTGTGCCTTTCTTTCTGCATTTGGGGATGTTTCAAAACCGGAAAG 
ATATAGTGGCGACAACATAATCTACAAACCACCAGGGCATTCTGCTCCGGATATGGTGTACT 
TGGGAGCTATGACAAATTTTGATGTGtACTTACAATTGGATTCAAGATAAATGTGTTCCTCTT 
GTCCGAGAAATAACATTTGAAAATGGtAGAGGAATTGACAGAAGAAGGACTGCCTTTTCTCAT 
ACTCTTTCACATGAAAGAAGATACAGAAAGTTTAGAAATATTCCAGAATGAAGTAGCTCGGC 
AATTAATAAGTGAAAAAGGTACAATiiiAACTTTTTACATGCCGATTGTGACAAATTTAGACAT 
CCTCTTCTGCACATACAGAAAACTCCAGCAGATTGTCCTGTAATCGCTATTGACAGCTTTAG 
GCATATGTATGTGTTTGGAGACTTCaiAAGATGTATTAATTCCTGGAAAACTCAAGCAATTCG 
TATTTGACTTACATTCTGGAAAACTGCACAGAGAATTCCATCATGGACCTGACCCAACTGAT 
ACAGCCCCAGGAGAGCAAGCCCAAGATGTAGCAAGCAGTCCACCTGAGAGCTCCTTCCAGAA 
ACTAGCACCCAGTGAATATAGGTATACTCTATTGAGGGATCGAGATGAGCT TTAAA AACTTG 
AAAAACAGTTTGTAAGCCTTTCAACAGCAGCATCAACCTACGTGGTGGAAATAGTAAACCTA 
TATTTTCATAATTCTATGTGTATTTTTATTTTGAATAAACAGAAAGAAATTTAAAAAAAAAA 
•AAAAAAAAAAAAAAAAAAAAAAAAAA?^AAAAAAAA 
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</usr/seqdb2/ sst/DNA/Dnaseqs .min/ss .DNA57689 
<subunit 1 of 1, 406 aa, 1 stop 
<MW: 46927, pi: 5.21, NX(£;/T) : 0 

MHPAVFLSLPDLRCSLLLLVTWVFTPVTTEITSLATENIDEILNNADVALVNFYADWCRFSQ 
MLHPIFEEASDVIKEEFPNENQWFi\RVDCDQHSDIAQRYRISKYPTLKLFRNGMMMKREYR 
GQRSVKALADYIRQQKSDPIQEIRDLAEITTLDRSKRNIIGYFEQKDSDNYRVFERVANILH 
DDCAFLSAFGDVSKPERYSGDNI I YKPPGHSAPDMVYLGAMTNFDVTYNWIQDKCVPLVRE I 
TFENGEELTEEGLPFLILFHMKEDTESLEIFQNEVARQLISEKGTINFLHADCDKFRHPLLH 
IQKTPADCPVIAIDSFRHMYVFGDFliCDVLIPGKLKQFVFDLHSGKLHREFHHGPDPTDTAPG 
EQAQDVASSPPES S FQKLAPSEYRYTLLRDRDEL 

Important features: 
Signal peptide: 

amino acids 1-29 

Endoplasmic reticuloim targeting secpience. 

amino acids 403-406 

Tyrosine kinase phosphorylation site. 

amino acids 203-211 

Thioredoxin family proteins 

amino acids 50-66 
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ATTAAGGAAGAATTTCCAAATGAA^VATCAAGTAGTNTTTGCCAGAGTNGATTGTGATCAGCA 
CTCTGACATAGCCCAGAGATACAGGATAAGCAAATACCCAACCCTCAAATTGTTTCGTAATG 
GGATGATGATGAAGAGAGAATACAGGGGTCAGCGATCAGTGAAAGCATTGGCAGATTA 



FIGURE 127 



AGAGGCCTCTCTGGAAGTTGTCCCGGGTGTTCGCCGCNGGAGCCCGGGTCGAGAGGACNAGG 
TGCCGCTGCCTGGAGAATCCTCCGCITGCCGTCGGCTCCCGGAGCCCAGCCCTTTCCTAACCC 
AACCCAACCTAGCCCNGTCCCAGCCGCCAGCGCCTGTCCCTGTCNCGGANCCCAGCGTNACC 
ATGCATCCTGCCGTCTTCCTATCCTTACCCGACCTCAGATGCTCCCTTCTGCTCCTGGTAAC 
TTGGGTTTTTACTCCTGTAACAACTGAAATAACNNGTCTTGATACNNAGAATATAGATGAAA 
TTTTAAACNATGCTGATGTGGCTTTAGTCAATTTTTATGCTGACTGGTGTCGTTTCAGTCAG 
ATGTGGCATCCAATTTTTGAGGANGCTTCCGATGTCATTAAGGAAGAATTTCCAAATGAAAA 
TCAAGTAGTGTTTGCCAGAGTTGATTGTGATCAGCACTCTGACATAGCCCAGAGATACAGGA 
TAAGCAAATACCCAACCCTCAAATTGTTTCGTAATGGGATGATGATGAAGAGAGAATACAGG 
GGTCAGCGATCAGTGAAAGCATTGGC^AGATTACATCAGGC 



FIGURE 128 



GCCCACGCGTCCGATGGCGTTCACGTTCGCGGCCTTCTGCTACATGCTGGCGCTGCTGCTCA 
CTGCCGCGCTCATCTTCTTCGCCATTTGGCACATTATAGCATTTGATGAGCTGAAGACTGAT 
TACAAGAATCCTATAGACCAGTGTA?^TACCCTGAATCCCCTTGTACTCCCAGAGTACCTCAT 
CCACGCTTTCTTCTGTGTCATGTTTCTTTGTGCAGCAGAGTGGCTTACACTGGGTCTCAATA 
TGCCCCTCTTGGCATATCATATTTGGAGGTATATGAGTAGACCAGTGATGAGTGGCCCAGGA 
CTCTATGACCCTACAACCATCATGAi^TGCAGATATTCTAGCATATTGTCAGAAGGAAGGATG 
GTGCAAATTAGCTTTTTATCTTCTAGCATTTTTTTACTACCTATATGGCATGATCTATGTTT 
TGGTGAGCTCTTAGAACAACACACAGAAGAATTGGTCCAGTTAAGTGCATGCAAAAAGCCAC 
CAAATGAAGGGATTCTATCCAGCAAGATCCTGTCCAAGAGTAGCCTGTGGAATCTGATCAGT 
TACTTTAAAAAATGACTCCTTATTTTTTAAATGTTTCCACATTTTTGCTTGTGGAAAGACTG 
TTTTCATATGTTATACTCAGATAAAC5ATTTTAAATGGTATTACGTATAAATTAATATAAAAT 
GATTACCTCTGGTGTTGACAGGTTTGAACTTGCACTTCTTAAGGAACAGCCATAATCCTCTG 
AATGATGCATTAATTACTGACTGTCCTAGTACATTGGAAGCTTTTGTTTATAGGAACTTGTA 
GGGCTCATTTTGGTTTCATTGAfiACilGTATCTAATTATAAATTAGCTGTAGATATCAGGTGC 
TTCTGATGAAGTGAAAATGTATATCTGACTAGTGGGAAACTTCATGGGTTTCCTCATCTGTC 
ATGTCGATGATTATATATGGATACAa?TTACAAAAATAAAAAGCGGGAATTTTCCCTTCGCTT 
GAATATTATCCCTGTATATTGCATGj^ATGAGAGATTTCCCATATTTCCATCAGAGTAATAAA 
TATACTTGCTTTAATTCTTAAGCATi^GTAAACATGATATAAAAATATATGCTGAATTACTT 
GTGAAGAATGCATTTAAAGCTATTTTAAATGTGTTTTTATTTGTAAGACATTACTTATTAAG 
AAATTGGTTATTATGCTTACTGTTCTAATCTGGTGGTAAAGGTATTCTTAAGAATTTGCAGG 
TACTACAGATTTTCAAAACTGAATGAGAGAAAATTGTATAACCATCCTGCTGTTCCTTTAGT 
GCAATACAATAAAACTCTGAAATTAJi.GACTC 



FIGURE 129 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA23330 
<subunit 1 of 1, 144 aa, 1 stop 
<MW: 16699, pi: 5.60, NX(S/T): 0 

MAFTFAAFCYMLALLLTAALIFFAIWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFF 
CVMFLCAAEWLTLGLmPLIA-^ffllWRYMSRPVMSGPGLYDPTTIMNADILAYCQKEGWCKLA 
FYLLAFFYYLYGMIYVLVSS 

Important features: 
Signal peptide: 

amino acids 1-20 

Type II transmembrane domsiin: 

amino acids 11-31 



Other transmembrane domain: 

amino acids 57-77 and 123-143 



F3CGURE 13 0 

ATTATAGCATTTGATGAGCTGAAGACTGATTACAAGATCCTATAGACCAGTGTAATACCCTG 
AATCCCCTTGTACTCCCAGAGTACCTCATCCACGCTTTCTTCTGTGTCATGTTTCTTTGTGC 
AGCAGAGTGGCTTACACTGGGTCTCiiATATGCCCCTCTTGGCATATCATATTTGGAGGTATA 
TGAGTAGACCAGTGATGAGTGGCCG^GGACTCTATGACCCTACAACCATCATGAATGCAGAT 
ATTCTAGCATATTGTCAGAAGGAAGC3ATGGTGCAAATTAGCTTTTTATCTTCTAGCATTTTT 
TTACTACCTATATGGCATGATCTATGTTTTGGTGAGCTCTTAGAACAACACACAGAAGAATT 
GGTCCAGTTAAGTGCATGCAAAAAGCCACCAAATGAAGGGATTCTATCCAGCAAGATCCTGT 
CCAAGAGTAGCCTGTGGAATCTGATCIAGTTACTTTAAAAAATG 
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CGGACGCGTGGGGGAAACCCTTCCGAGAAAACAGCAACAAGCTGAGCTGCTGTGACAGAGGG 
GAACAAGATGGCGGCGCCGAAGGGGi^GCCTCTGGGTGAGGACCCAACTGGGGCTCCCGCCGC 
TGCTGCTGCTGACCATGGCCTTGGCCGGAGGTTCGGGGACCGCTTCGGCTGAAGCATTTGAC 
TCGGTCTTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAGTTGACCTACCCCTTGCACAC 
CTACCCTAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTGCAGGCTGTTTTCAATTTGTC 
AGTTTGTGGATGATGGAATTGACTTAAATCGAACTAAATTGGAATGTGAATCTGCATGTACA 
GAAGCATATTCCCAATCTGATGAGC^kATATGCTTGCCATCTTGGTTGCCAGAATCAGCTGCC 
ATTCGCTGAACTGAGACAAGAACAACTTATGTCCCTGATGCCAAAAATGCACCTACTCTTTC 
CTCTAACTCTGGTGAGGTCATTCTGGJAGTGACATGATGGACTCCGCACAGAGCTTCATAACC 
TCTTCATGGACTTTTTATCTTCAAGCCGATGACGGAAAAATAGTTATATTCCAGTCTAAGCC 
AGAAATCCAGTACGCACCACATTTGGAGCAGGAGCCTACAAATTTGAGAGAATCATCTCTAA 
GCAAAATGTCCTATCTGCAAATGAG7LAATTCACAAGCGCACAGGAATTTTCTTGAAGATGGA 
GAAAGTGATGGCTTTTTAAGATGCCT'CTCTCTTAACTCTGGGTGGATTTTAACTACAACTCT 
TGTCCTCTCGGTGATGGTATTGCTTTGGATTTGTTGTGCAACTGTTGCTACAGCTGTGGAGC 
AGTATGTTCCCTCTGAGAAGCTGAGTATCTATGGTGACTTGGAGTTTATGAATGAACAAAAG 
CTAAACAGATATCCAGCTTCTTCTCTTGTGGTTGTTAGATCTAAAACTGAAGATCATGAAGA 
AGCAGGGCCTCTACCTACAAAAGTG^ATCTTGCTCATTCTGAAA TTTAA GCATTTTTCTTTT 
AAAAGACAAGTGTAATAGACATCTASAATTCCACTCCTCATAGAGCTTTTAAAATGGTTTCA 
TTGGATATAGGCCTTAAGAAATCACTATAAAATGCAAATAAAGTTACTCAAATCTGTG 



< /us r / s eqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA2 6847 
<siibunit 1 of 1, 323 aa, 1 stop 
<MW: 36223, pi: 5.06, NX(S/T}: 1 

MAAPKGSLWVRTQLGLPPLLLLTiyyVLAGGSGTASAEAFDSVLGDTASC3IRACQLTYPLHTYP 

keeelyacqrgcrlfsicqfvddg]:dlnrtklecesacteaysqsdeqyachlgcqnqlpfa 
elrqeqlmslmpkmhllfpltlvrsfwsdmmdsaqsfitsswtfylqaddgkivifqskpei 
qyaphleqeptnlresslskmsylqmrnsqahrnfledgesdgflrclslnsgwiltttlvl 
svmvllwiccatvataveqyvpseklsiygdlefmneqklnrypasslvvvrsktedheeag 

PLPTKVNLAHSEI 

Important features: 
Signal peptides 

amino acids 1-31 

Transmembrane domain: 

amino acids 241-260 

N-glycosylation site. 

amino acids 90-93 



FIGURE 133 



TTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAGTTGACCTACCCCTTGCACACCTACCC 
TAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTGCAGGCTGTTTTCAATTTGTCAGTTTG 
TGGATGATGGAATTGACTTAAATCG.AACTAAATTGGAATGTGAATCTGCATGTACAGAAGCA 
TATTCCCAATCTGATGAGCAATATGCTTGCCATCTTGGTTGCCAGAATCAGCTGCCATTCGC 
TGAACTGAGACAAGAACAACTTATGTCCCTGATGCCAAAAATGCACCTACTCTTTCCTCTAA 
CTCTGGTGAGGTCATTCTGGAGTGACATGATGGACTCCGC 



FIGURE 134 



cacactggccggatcttttagagtcctttgaccttgaccaagggtcnggaaaacagcaacaa 
gctgagctgctgtgacagagggaacaagatggcggcgccgaagggagcctttgggtgaggac 
ccaactggggctcccgccgctgctgctgctgaccatggccttggccggaggttcggggaccg 
cttcggctgaagcatttgactcggtcttgggtgatacggcgtcttgccaccgggcctgtcag 
ttgacctaccccttgcacacctaccctaaggaagaggagttgtacgcatgtcagagaggttg 
caggctgttttcaatttgtcagtttgtggatgatggaattgacttaaatcgaactaaattgg 
aatgtgaatctgcatgtacagaagcatattcccaatctgatgagcaatatgcttgccatctt 
ggttgccagaatcagctgccattcgctgaactgagacaagaacaacttatgtccctgatgcc 
aaaaatgcacctactctttcctcta;\ctctggtgaggtcattctggagtgacatgatggact 

CCGC 



FIGURE 135 



GCGAGGTGGCGATCGCTGAGAGGCAGGAGGGCCGAGGCGGGCCTGGGAGGCGGCCCGGAGGT 
GGGGCGCCGCTGGGGCCGGCCCGCACGGGCTTCATCTGAGGGCGCACGGCCCGCGACCGAGC 
GTGCGGACTGGCCTCCCAAGCGTGGGGCGACAAGCTGCCGGAGCTGCAATGGGCCGCGGCTG 
GGGATTCTTGTTTGGCCTCCTGGGCGCCGTGTGGCTGCTCAGCTCGGGCCACGGAGAGGAGC 
AGCCCCCGGAGACAGCGGCACAGAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGATGATTGT 
ACCTGTGATGTTGAAACCATTGATAGATTTAATAACTACAGGCTTTTCCCAAGACTACAAAA 
ACTTCTTGAAAGTGACTACTTTAGGTATTACAAGGTAAACCTGAAGAGGCCGTGTCCTTTCT 
GGAATGACATCAGCCAGTGTGGAAG.?iAGGGACTGTGCTGTCAAACCATGTCAATCTGATGAA 
GTTCCTGATGGAATTAAATCTGCGAGCTACAAGTATTCTGAAGAAGCCAATAATCTCATTGA 
AGAATGTGAACAAGCTGAACGACTTGGAGCAGTGGATGAATCTCTGAGTGAGGAAACACAGA 
AGGCTGTTCTTCAGTGGACCAAGCATGATGATTCTTCAGATAACTTCTGTGAAGCTGATGAC 
ATTCAGTCCCCTGAAGCTGAATATGTAGATTTGCTTCTTAATCCTGAGCGCTACACTGGTTA 
CAAGGGACCAGATGCTTGGAAAATATGGAATGTCATCTACGAAGAAAACTGTTTTAAGCCAC 
AGACAATTAAAAGACCTTTAAATCCTTTGGCTTCTGGTCAAGGGACAAGTGAAGAGAACACT 
TTTTACAGTTGGCTAGAAGGTCTCTGTGTAGAAAAAAGAGCATTCTACAGACTTATATCTGG 
CCTACATGCAAGCATTAATGTGCATTTGAGTGCAAGATATCTTTTACAAGAGACCTGGTTAG 
AAAAGAAATGGGGACACAACATTACAGAATTTCAACAGCGATTTGATGGAATTTTGACTGAA 
GGAGAAGGTCCAAGAAGGCTTAAGA^^CTTGTATTTTCTCTACTTAATAGAACTAAGGGCTTT 
ATCCAAAGTGTTACCATTCTTCGAGCGCCCAGATTTTCAACTCTTTACTGGAAATAAAATTC 
AGGATGAGGAAAACAAAATGTTACTTCTGGAAATACTTCATGAAATCAAGTCATTTCCTTTG 
CATTTTGATGAGAATTCATTTTTTGCTGGGGATAAAAAAGAAGCACACAAACTAAAGGAGGA 
CTTTCGACTGCATTTTAGAAATATTTCAAGAATTATGGATTGTGTTGGTTGTTTTAAATGTC 
GTCTGTGGGGAAAGCTTCAGACTCAGGGTTTGGGCACTGCTCTGAAGATCTTATTTTCTGAG 
AAATTGATAGCAAATATGCCAGAAAGTGGACCTAGTTATGAATTCCATCTAACCAGACAAGA 
AATAGTATCATTATTCAACGCATTTGK3AAGAATTTCTACAAGTGTGAAAGAATTAGAAAACT 
TCAGGAACTTGTTACAGAATATTCAT TAAA GAAAACAAGCTGATATGTGCCTGTTTCTGGAC 
AATGGAGGCGAAAGAGTGGAATTTC/^TTCAAAGGCATAATAGCAATGACAGTCTTAAGCCAA 
ACATTTTATATAAAGTTGCTTTTGTAAAGGAGAATTATATTGTTTTAAGTAAACACATTTTT 
AAAAATTGTGTTAAGTCTATGTATA5.TACTACTGTGA6TAAAAGTAATACTTTAATAATGTG 
GTACAAATTTTAAAGTTTAATATTGJ^ATAAAAGGAGGATTATCAAATTAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 136 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA53974 
<subunit 1 of 1, 468 aa, 1 stop 
<MW: 54393, pi: 5.63, NX(S/T): 2 

MGRGWGFLFGLLGAVWLLSSGHGEKQPPETilAQRCFCQVSGYLDDCTCDVETIDRFNNYRLF 
PRLQKLLESDYFRYYKVNLKRPCPE'WNDISQCGRRDCAVKPCQSDEVPDGIKSASYKYSEEA 
KNLIEECEQAERLGAVDESLSEETQKAVLQWTKHDDSSDNFCEADDIQSPEAEYVDLLIiNPE 
RYTGYKGPDAWKIWNVIYEENCFKI>QTIKRPLNPLASGQGTSEENTFYSWLEGLCVEKRAFY 
RLISGLHASINVHLSARYLLQETWLEKXWGHNITEFQQRFDGILTEGEGPRRLKNLYFLYLI 
ELRALSKVLPFFERPDFQLFTGNKIQDEENKMLLLEILHEIKSFPLHFDENSFFAGDKKEAH 
KLKEDFRLHFRNI SRIMDCVGCFKCRLWGKLQTQGLGTALKI LFSEKLIANMPESGPS YEFH 
LTRQEIVSLFNAFGRISTSVKELENFRNLLQNIH 

Important features; 
Signal peptide: 

amino acids 1-23 

N-glycosylation site. 

amino acids 280-283 and 384-387 

Amidation site. 

amino acids 94-97 

Glycosaminoglycan attachment site. 

amino acids 20-23 and 223-226 

Aminotransferases class-V pyridoxal -phosphate 

amino acids 216-222 

Interleukin-7 proteins 

amino acids 338-343 



FIGURE 137 

GCTGGAAATATGGATGTCATCTACGAGAAACTGTTTTAAGCCACAGACAATTAAAAGACCTT 
TAAATCCTTTGGCTTCTGGTCAAGGJGACAAGTGAAGAGNACACTTTTTACAGTTGGCTAGAA 
GGTCTCTGTGTAGAAAAAAGAGCATTCTACAGACTTATATCTGGCCTACATGCAAGCATTAA 
TGTGCATTTGAGTGCAAGATATCTITTACAAGAGACCTGGTTAGAAAAGAAATGGGGACACA 
ACATTACAGAATTTNAACAGCGATTTGATGGAATTTTGACTGAAGGAGAAGGTCCAAGAAGG 
CTTAAGAACTTGTATTTTCTCTACTTAATAGAACTAAGGGCTTTATCCAAAGTGTTACCATT 
CTTNGAGCGCCCAGATTTTCAACTNTTTACTGGAAATAAAATTCAGGATGAGGNAAACAAAA 
TGTTACTTTTGGAAATACTTCATGAjaATCAAGTCATTTCCTTTGCATTTTGATGAGAATTCA 
TTTTTTTGCTG 



FIGURE 138 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGTTGGGAGGGGGCAGGATGGGAGGGAA 
AGTGAAGAAAACAGAAAAGGAGAGGGACAGAGGCCAGAGGACTTCTCATACTGGACAGAAAC 
CGATCAGGCATGGAACTCCCCTTCGTCACTCACCTGTTCTTGCCCCTGGTGTTCCTGACAGG 
TCTCTGCTCCCCCTTTAACCTGGATGAACATCACCCACGCCTATTCCCAGGGCCACCAGAAG 
CTGAATTTGGATACAGTGTCTTACAACATGTTGGGGGTGGACAGCGATGGATGCTGGTGGGC 
GCCCCCTGGGATGGGCCTTCAGGCGACCGGAGGGGGGACGTTTATCGCTGCCCTGTAGGGGG 
GGCCCACAATGCCCCATGTGCCAAGGGCCACTTAGGTGACTACCAACTGGGAAATTCATCTC 
ATCCTGCTGTGAATATGCACCTGGGGATGTCTCTGTTAGAGACAGATGGTGATGGGGGATTC 
ATGGTGAGCTAAGGAGAGGGTGGTGGCAGTGTCTCTGAAGGTCCATAAAAGAAAAAAGAGAA 
GTGTGGTAAGGGAAAATGGTCTGTGTGGAGGGGTCAAGGAGTTAAAAACCCTAGAAAGCAAA 
AGGTAGGTAATGTCAGGGAGTAGTCTTCATGCCTCCTTCAACTGGGAGCATGTTCTGAGGGT 
GCCCTCCCAAGCCTGGGAGTAACTATTTCCCCCATCCCCAGGCCTGTGCCCCTCTCTGGTCT 
CGTGCTTGTGGCAGCTCTGTCTTCAGTTCTGGGATATGTGCCCGTGTGGATGCTTCATTCCA 
GCCTCAGGGAAGCCTGGCACCCACTGCCCAACGTGAGCCAGAGGAAGGCTGAGTACTTGGTT 
CCCAGAAGGAGATACTGGGTGGGAAZVAAGATGGGGCAAAGCG6TATGATGCCTGGCAAAGGG 
CCTGCATGGCTATCCTCATTGCTACCTAATGTGCTTGCAAAAGCTCCATGTTTCCTAACAGA 
TTCAGACTCCTGGCCAGGTGTGGTGGCCCACACCTGTAATTCTAGCACTTTGGGAGGCCAAG 
GTGGGCAGATCACTTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACTCCAT 
CTCTACTAAAAAAAAAAAAATACAAiWlTTAGCTGGGTGCGCTAGTGCATGCCTGTAATCTC 
ATCTACTCGGGAGGCTAAGACAGGAGACTCTCACTTCAACCCAGGAGGTGGAGGTTGCGGTG 
AGCCAAGATTGTGCCTCTGCACTCTAGCGTGGGTGACAGAGTAAGCGAGACTCCATCTCAAA 
AATAATAATAATAATAATTCAGACTCCTTATCAGGAGTCCATGATCTGGCCTGGCACAGTAA 
CTCATGCCTGTAATCCCAACATTTTC5GGAGGCCAACGCAGGAGGATTGCTTGAGGTCTGGAG 
GTTTGAGACCAGCCTGGGCAACATAGAAAGACCCCATCTCTAAATAAATGTTTTAAAAAT 



FIGURE 13 9 

>< /usr / seqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA5 7 0 3 9 
xsubunit 1 of 1, 124 aa, 1 stop 
xMW: 13352, pi: 5.99, NX(S/T): 1 

MELPFVTHLFLPLVFLTGLCSPFNLDEHHPRLFPGPPEAEFGYSVLQHVGGGQRWMLVGAPW 
DGPSGDRRGDVYRCPVGGAHNAPCAKIGHLGDYQLGNSSHPAVNMHLGMSLLETDGDGGFMVS 

Important features : 
Signal peptide: 

amino acids 1-22 

Cell attachment sequence. 

amino acids 70-73 

N-glycosylation site. 

amino acids 98-101 

Integrins alpha chain proteins 

amino acids S7-81 



FIGURE 140 



CACAGTTCCCCACCATCACTCNTCCCATTCCTTCCAACTTTATTTTTAGCTTGCCATTGGGA 
GGGGGCAGGATGGGAGGGAAAGTGi!vAGAAAACAGAAAAGGAGAGGGACAGAGGCCAGAGGAC 
TTCTCATACTGGACAGAAACCGATCAGGCATGGAACTCCCCTTCGTCACTCACCTGTTCTTG 
CCCCTGGTGTTCCTGACAGGTCTCTGCTCCCCCTTTAACCTGGATGAACATCACCCACGCCT 
ATTCCCAGGGCCACCAGAAGCTGAATTTGGATACAGTGTCTTACAACATGTT6GGGGTGGAC 
AGCGATGGATGCTGGTGGGCGCCCCCTGGGATGGGCCTTCAGGCGACCGGAGGGGGGACGTT 
TATCGCTGCCCTGTAGGGGGGGCCCACAATGCCCCATGTGCCAAGGGCCACTTAGGTGACTA 
CCAACTGGGAAATTCATCTCATCCTGCTGTGAATATGCACCTGGGGATGTCTCTGTTAGAGA 
CAGATGGTGATGG 



FIGURE 141 

AAAGTTACATTTTCTCTGGAACTd^CCTAGGCCACTCCCTGCTGATGCAACATCTGGGTTTG 
GGCAGAAAGGAGGGTGCTTCGGAGC:CCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAGAACA 
ATTCAGGCTTCGCTGCGACTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAGATGGCT 
GAGATGGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGGTCAAACTGAGTCTACCA 
AATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCTTTTCATGTGGTTTTTCT 
ACGCATTGATTCCATGTTTGCTCACIAGATGAAGTGGCCATTCTGCCTGCCCCTCAGAACCTC 
TCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCAGTGATCGCGCCTGGAGA 
AACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTGTACACGAGCCACATCT 
GGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTGATGTCACTGATGACATC 

acggccactgtgccatacaaccttc:gtgtcagggccacattgggctcacagacctcagcctg 
gagcatcctgaagcatccctttaatagaaactcaaccatccttacccgacctgggatggaga 
tcaccaaagatggcttccacctggttattgagctggaggacctggggccccagtttgagttc 

CTTGTGGCCTACTGGAGGAGGGAGC:CTGGTGCCGAGGAACATGTCAAAATGGTGAGGAGTGG 

gggtattccagtgcacctagaaacc:atggagccaggggctgcatactgtgtgaaggcccaga 

CATTCGTGAAGGCCATTGGGAGGTACAGCGCCTTCAGCCAGACAGAATGTGTGGAGGTGCAA 

ggagaggccattcccctggtactggccctgtttgcctttgttggcttcatgctgatccttgt 
ggtcgtgccactgttcgtctggaa^vatgggccggctgctccagtactcctgttgccccgtgg 

TGGTCCTCCCAGACACCTTGAAZ^-I'AACCAATTCACCCCAGAAGTTAATCAGCTGCAGAAGG 
GAGGAGGTGGATGCCTGTGCCACGGCTGTGATGTCTCCTGAGGAACTCCTCAGGGCCTGGAT 

ctcataggtttgcggaagggcccaggtgaagccgagaacctggtctgcatgacatggaaacc 

ATGAGGGGACAAGTTGTGTTTCTGITTTCCGCCACGGACAAGGGATGAGAGAAGTAGGAAGA 

gcctgttgtctacaagtctagaagcaaccatcagaggcagggtggtttgtctaacagaacac 

TGACTGAGGCTTAGGGGATGTGACCTCTAGACTGGGGGCTGCCACTTGCTGGCTGAGCAACC 

ctgggaaaagtgacttcatcccttcggtcctaagttttctcatctgtaatgggggaattacc 

TACACACCTGCTAAACACACACACA.CAGAGTCTCTCTCTATATATACACACGTACACATAAA 

tacacccagcacttgcaaggctagagggaaactggtgacactctacagtctgactgattcag 

TGTTTCTGGAGAGCAGGACATAAATGTATGATGAGAATGATCAAGGACTCTACACACTGGGT 

ggcttggagagcccactttcccagaataatccttgagagaaaaggaatcatgggagcaatgg 

TGTTGAGTTCACTTCAAGCCCAATGCCGGTGCAGAGGGGAATGGCTTAGCGAGCTCTACAGT 

aggtgacctggaggaaggtcacagccacactgaaaatgggatgtgcatgaacacggaggatc 

CATGAACTACTGTAAAGTGTTGACAGTGTGTGCACACTGCAGACAGCAGGTGAAATGTATGT 

gtgcaatgcgacgagaatgcagaagtcagtaacatgtgcatgtttgttgtgctccttttttc 

TGTTGGTAAAGTACAGAATTCAGCAAATAAAAAGGGCCACCCTGGCCAAAAGCGGTAAAAAA 



</usr/ seqdb2/ sst/DNA/Dnaseqs .min/ss .DNA57033 
<siibunit 1 of 1, 311 aa, 1 stop 
<MW: 35076, pi: 5.04, NX{,S/T): 2 

MQTFTMVLEEIWTSLFMWFFYALIPCLLTDEVAILPAPQNLSVLSTNMKHLLMWSPVIAPGE 
TVYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITATVPYNLRVRATLGSQTSAW 
SILKHPFNRNSTILTRPGMEITKDGFHLVIELEDLGPQFEFLVAYWRREPGAEEHVKMVRSG 
GIPVHLETMEPGAAYCVKAQTFVKA.IGRYSAFSQTECVEVQGEAIPLVLALFAFVGFMLILV 
WPLFVWKMGRLLQYSCCPWVLPDTLKITNSPQKLISCRREEVDACATAVMSPEELLRAWIS 

Important features: 
Signal peptide: 

amino acids 1-29 

Transmembrane domain: 

amino acids 23 0-255 

N-glycosylation site. 

amino acids 40-43 and 134-137 

Tissue factor proteins, 

amino acids 92-119 

Integrins alpha chain proteins 

amino acids 232-262 



FIGURE 143 



TCCTGCTGATGCACATCTGGGTTTGiGCAAAAGGAGGTTGCTTCGAGCCGCCCTTTCTAGCTT 
CCTGGCCGGCTCTAGAACAATTCA(3GCTTCGCTGCGACTAGACCTCAGCTCCAACATATGCA 
TTCTGAAGAAAGATGGCTGAGATGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGG 
TCAAACTGAGTCTACCAAATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCT 
TTTCATGTGGTTTTTCTACGCATTGfATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC 
CTGCCCCTCAGAACCTCTCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCA 
GTGATCGCGCCTGGAGAAACAGTGl'ACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCT 
GTACACGAGCCACATCTGGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTG 
ATGTCACTGATGACATCACGGCCACITGTGCCATACAACCTTTGTGTCAGGGCCACATTGGGC 
TCACAGACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTAC 
CCGACCTGGGATGGAGATCACCAAAGATGGCTTNCACCTGGTTATTGAGCTGGAGGACCTGG 
GGCCCCAGTTTGAGTTCCTTGTGGCCTANTGGAGGAGGGGCGAACCCCTTGCGGCGCAAGGG 
GTTNGCGAACCCCTTGCG6CCGCTGGGGTATCTCTCGAGAAAAGAGAGGCCCAATATGACCC 
ACATACTCAATATGGACGAANTGCTATTGTCCACCTGTTTGAGTGGCGCTGGGTTGAT 



FIGURE 144 



CCCACGCGTCCGCCCACGCGTCCGAGGGACAAGAGAGAAGAGAGACTGAAACAGGGAGAAGA 
GGCAGGAGAG6AGGAGGTGGGGAGAGCACGAAGCTGGAGGCCGACACTGAGGGAGGGCGGGA 
GGAGGTGAAGAAGGAGAGAGGGGAGAAGAGGCAGGAGCTGGAAAGGAGAGAGGGAGGAGGAG 
GAGGAGATGCGGGATGGAGACCTGGiAGTTAGGTGGCTTGGGAGAGCTTAATGAAAAGAGAAC 
GGAGAGGAGGTGTGGGTTAGGAACCAAGAGGTAGCCCTGTGGGCAGCAGAAGGCTGAGAGGA 
GTAGGAAGATCAGGAGCTAGAGGGAGACTGGAGGGTTCCGGGAAAAGAGCAGAGGAAAGAGG 
AAAGACACAGAGAGACGGGAGAGAGAAGAAGAGTGGGTTTGAAGGGCGGATCTCAGTCCCTG 
GCTGCTTTGGCATTTGGGGAACTGGGACTCCCTGTGGGGAGGAGAGGAAAGCTGGAAGTCCT 
GGAGGGACAGGGTCCCAGAAGGAGGGGACAGAGGAGCTGAGAGAGGGGGGCAGGGCGTTGGG 
CAGGGGTCCCTCGGAGGCCTCCTGQ3 GATGG GGGCTGCAGCTCGTCTGAGCGCCCCTCGAGC 
GCTGGTACTCTGGGCTGCACTGGGGGCAGCAGCTCACATCGGACCAGCACCTGACCCCGAGG 
ACTGGTGGAGCTACAAGGATAATCTCCAGGGAAACTTCGTGCCAGGGCCTCCTTTCTGGGGC 
CTGGTGAATGCAGCGTGGA6TCTGTGTGCTGTGGGGAAGCGGCAGAGCCCCGTGGATGTGGA 
GCTGAAGAGGGTTCTTTATGACCCCTTTCTGCCCCCATTAAGGCTCAGCACTGGAGGAGAGA 
AGCTCCGGGGAACCTTGTACAACACCGGCCGACATGTCTCCTTCCTGCCTGCACCCCGACCT 
GTGGTCAATGTGTCTGGAGGTCCCCTCCTTTACAGCCACCGACTCAGTGAACTGCGGCT6CT 
GTTTGGAGCTCGCGACGGAGCCGGCTCGGAACATCAGATCAACCACCAGGGCTTCTCTGCTG 
AGGTGCAGCTCATTCACTTCAA.CCAGGAACTCTACGGGAATTTCAGCGCTGCCTCCCGCGGC 
CCCAATGGCCTGGCCATTCTCAGCCTCTTTGTCAACGTTGCCAGTACCTCTAACCCATTCCT 
CAGTCGCCTCCTTAACCGCGACACCilTCACTCGCATCTCCTACAAGAATGATGCCTACTTTC 
TTCAAGACCTGAGCCTGGAGCTCCTGTTCCCTGAATCCTTCGGCTTCATCACCTATCAGGGC 
TCTCTCAGCACCCCGCCCTGCTCCGJ^GACTGTCACCTGGATCCTCATTGACCGGGCCCTCAA 
TATCACCTCCCTTCAGATGCACTCCCTGAGACTCCTGAGCCAGAATCCTCCATCTCAGATCT 
TCCAGAGCCTCAGCGGTAACAGCCGGCCCCTGCAGCCCTTGGCCCACAGGGCACTGAGGGGC 
AACAGGGACCCCCGGCACCCCGAGAGGCGCTGCCGAGGCCCCAACTACCGCCTGCATGTGGA 
TGGTGTCCCCCATGGTCGCTGAGACI'CCCCTTCGAGGATTGCACCCGCCCGTCCTAAGCCTC 
CCCACAAGGCGAGGGGAGTTACCCCT.AAAACAAAGCTATTAAAGGGACAGAATACTTA 



RTfflB 



FIGURE 145 



</usr/seqdb2/sst/DNA/Dna£!eqs .min/ ss .DNA34353 
<subunit 1 of 1, 328 aa, 1 stop 
<MW; 36238, pi: 9.90, NX(S/T): 3 

MGAAARLSAPRALVLWAALGAAAHIGPAPDPEDWWSYKDNLQGNFVPGPPFWGLVNAAWSLC 
AVGKRQSPVDVELKRVLYDPFLPPIjRLSTGGEKLRGTLYNTGRHVSFLPAPRPVWVSGGPL 
LYSHRLSELRLLFGARDGAGSEHQINHQGFSAEVQLIHFNQELYGNFSAASRGPNGLAILSL 
FVNVASTSNPFLSRLLNRDT ITRI S YKNDAYFLQDLSLELLFPES FGFITYQGS LSTPPCSE 
TVTWILIDRAIiNITSLQMHSLRLLSQNPPSQIFQSLSGNSRPLQPLAHRALRGNRDPRHPER 
RCRGPNYRLHVDGVPHGR 

Important features: 
Signal peptide: 

amino acids 1-23 

Transmembrane domain: 

amino acids 177-199 

N-glycosylation site. 

amino acids 118-121, 170-173 and 260-263 

Eukaryotxc-type carbonic anhydrases proteins 

amino acids 222-270, 128-164 and 45-92 



H 



arfff 



FIGURE 146 



GGCGCCTGGTTCTGCGCGTACTGGCTGTAC:GGAGCaGGAGCaAGAGGTCGCCGCCAGCCTCCGCCGCCGAGCCTC 

GTTCGTGTCCCCGCCCCTCGCTCCTGCAGCITACTGCTCAGAAACGCTGGGGCGCCCACCCTGGCAGACTAACGaA 

GCaGCTCCCTTCCCACCCCSUlCTGCAGGTC^TAATTTTGGACGCTTTGCCTGCCATTTCTTCCAGGTTGAGG^ 

CGCAGAGGCGGAGGCTCGCGTATTCCTGC^iGTCAGCACCCACGTCGCCCCCGGACGCTCGGTGCTCaGGCC^^^ 

GCGAGCGGGGCTCTCCGTCTGCGGTCCCTTGTGAAGGCTCTGGGCGGCTGCAGAGGCCGGCCGTCCGGTTTGGCT 

CACCTCTCCCAGGAAACTTCACACTGGAGa.GCaiAAAGGAGTGGAAGAGCCTGTCTTGGAGATTTTCCTGGGGAA 

ATCCTGAGGTCATTCATTMIGAAGTGTACCGCGCGGGAGTGGCTCAGAGTAACCACaGTGCTGTT^^ 

GCAATTCCAGCCATGGTGGTTCCCAATGCC:ACTTTATTGGAGA?iAOTTTTGGAAAAATACATGGATGAGGATG^^ 

GAGTGGTGGATAGCOy^CAACGAGGGAA^AGGGCCATCACAGACAATGACaTGCAGAGTATTTTG^ 

AATAAATTACGAAGTCAGGTGTATCCAACftGCCTCTAATATGGAGTATATGACATGGGATGTAGAGCTGGAAAGA 

TCTGCAGAATCCTGGGCTGAAAGTTGCTTGlTGGGAACATGGACCTGCAAGCTTGCTTCCaTCAATTGGACa.G^ 

TTGGGAGCACACTGGGQAAGATATAGGCCCCCGACGTTTCATGTACAATCGTGGTATGATGAAGTGAAAGACTTT 

AGCTACCCaTATGAAC:a.TGAATGCAACCCTiTATTGTCCATTCAGGTGTTCTGGCCCTGTATGTACAC:ATT^^ 

CAGGTCGTGTGGGCAACTAGTAACAGAATC:GGTTGTGCCATTAATTTGTGTCaTAACa.TGAACATCTGGGGGCAG 

ATATGGCCCAAAGCTGTCTACCTGGTGTGCAATTACTCCCCAAAGGGAAACTGGTGGGGCCATGCCCCTTAC^^ 

CATGGGCGGCCCTGTTCTGCTTGCCCACCTAGTTTTGGAGGGGGCT6TAGAGAAAATCTGTGCTACAAAGAAGGG 

TCAGAC^lGGTATTATCCCCCTCGAGAAGAGGAAACAAATGAAATAGAACGACAGCAGTCACAAGTCCATGACA 

CATGTCCGQACAAGATCAGATGATAGTAGCAG2^TGAAGTCATAAGCGCaCAGC»LAATGTCCCS^ 

TGTQAAGTAAGATTAAGAGATCAGTGCaAS.GGAACAACCTGCAATAGGTACGAATGTCCTGCTGGCTGTTTGGAT 

AGTAAAGCTAAAGTTATTGGCAGTGTACATTATGAAATGCaATCCAGCATCTGTAGAGCTGCAATTCATTATGGT 

ATAATAGAO^TGATGGTGGOTGGGTAGATATO^CTAGACT^GGAAQAAAGCTlTTATTTCaTCaAGTCC^ 

AATGGTATTCAAACaATTGGCAAATATCAGTCTGCTAATTCCTTCACaGTCTCTAAAGTAACAGTTC^ 

ACTTGTGAAACAACTGTGGAACAGCTCTGTCCATTTCATAAGCCTGCTTCACATTGCCCAAGAGTATACTGTCCT 

CGTAACTGTATGCT^GCaAATCCACATTATGCTCGTGTAATTGGAACTCGAGTTTATTCTGATCTGTCCAGTA^ 

TGCAGAGCaGCAGTACATGCTGGAGTGGTTCGAAATCACGGTGGTTATGTTGATGTAATGCCTGTGGAaVAAA^ 

AAGACCTACATTGCTTCTTTTCAGAATGGAATCTTCTCAGAAAGTTTACAGAATCCTCCAGGAGGAAAGGCATTC 

AGAGTGTTTGCTGTTGTGSGAAACTGAATACTTGGAAGAGGACCATAAAGACTATTCCAAATGC?^TATTTCTGA 

ATTTTGTATAAAACTGTAACATTACTGTACaGAGTAOlTCT^CTATTTTCAGCCCAAAAAG^^ 

TAAATCTTGATAAACAAAGTCTATAAAATAAAACATGGGACATTAGCTTTGGGAAAAGTAATGAAAATATAATGG 

TTTTAGAAATCCTGTGTTAAATATTGCTATATTTTCTTAGCAGTTATTTCTACAGTTAATTACATAGTCaTGATT 

GTTCTACGTTTCATATATTATATGGTGCTTTGTATATGCCACTAATAAAATGAATCTAAACATTGAATGTGAATG 

GCCCTCAGAAAATCATCTAGTGCATTTAAA?^TAATCGACTCTAAAACTGAAAGAAACCTTATCACATTTTCCCC 

AGTTCSUlTGCTATGCCATTACCaACTCCaA^TAATCTCa^TAATTTTCa^CTTAATAACT^ 

TGTTAATTTAGGCATATAGAATATTAAATTCTGATATTGCACTTCTTATTTTATATAAAATAATCCTTTAATATC 

CAAATGAATCTGTTAASlATGTTTGATTCCTTGGGAATGGCCTTAAAAATAAATGTAATAAAGTCAQAGTGGTGGT 

ATGAAAACATTCCTAGTGATCATGTAGTAA^TGTAGGGTTAAGCATGGACaGCCAGAGCTTTCTATGTACTGTTA 

AAATTGAGGTCACATATTTTCTTTTGTATCCTGGCAAATACTCCTGCAGGCCAGGAAGTATAATAGCAAAAAGTT 

GAACy^GATGAACTAATGTATTACATTAC(:::ATTGCCa^CTGATTTTTTTTAAATGGTAAATGACCrTGTATATAA 

ATATTGCCATATCATGGTACCTATAATGGTI3ATATATTTGTTTCTATGAAAAATGTATTGTGCTTTGATACTAAA 

AATCTGTAAAATGTTAGTTTTGGTAATTTTTTTTCTGCTGGTGGATTTACATATTAAATTTTTTCTGCTGGTGGA 

TAAACATTAAAATTAATCATGTTTCAAAAAi!iAAAAAAA 



FIGURE 147 



</usr/seqdb2/sst/DNA/Dnasegs.min/ss.DNA45417 
<subunit 1 of 1, 500 aa, 1 stop 
<MW: 56888, pi: 8.53, NX(S/T): 2 

MKCTAREWLRVTTVLFMARAIPAiyn/VPNATLLEKLLEKYMDEDGEWWIAKQRGKRAITDN^ 

QSILDLHNKLRSQVYPTASNMEYMTWDVELERSAESWAESCLWEHGPASLLPSIGQNLGAHW 

GRYRPPTFHVQSWYDEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQWWATSNRIGCAINLC 

HNMNIWGQIWPKAVYLVCNYSPKGN-WWGHAPYKHGRPCSACPPSFGGGCRENLCYKEGSDRY 

YPPREEETNEIERQQSQVHDTHVRTRSDDSSRNEVISAQQMSQIVSCEVRLRDQCKGTTCNR 

YECPAGCLDSKAKVIGSVHYEMQSSICRAAIHYGIIDNDGGWVDITRQGRKHYFIKSNRNGI 

QTIGKYQSANSFTVSKVTVQAVTCETTVEQLCPFHKPASHCPRVYCPRNCMQANPHYARVIG 

TRVYSDLSSICRAAVHAGWRNHGGYVDVMPVDKRKTYIASFQNGIFSESLQNPPGGKAFRV 

FAW 

Important features : 
Signal peptide: 

amino acids 1-20 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 protein 

amino acids 165-186, 196-218, 134-146, 96-108 and 58-77 

N-glycosylation site 

amino acids 28-31 



FIGURE 148 



GCGGAGACAAGCGCAGAGCGCAGCGCACGGCCACAGACAGCCCTGGGCATCCACCGACGGCG 
CAGCCGGAGCCAGCAGAGCCGGAAGGCGCGCCCCGGGCAGAGAAAGCCGAGCAGAGCTGGGT 
GGCGTCTCCGGGCCGCCGCTCCGACGGGCCAGCGCCCTCCCCATGTCCCTGCTCCCACGCCG 
CGCCCCTCCGGTCAGCATGAGGCTCCTGGCGGCCGCGCTGCTCCTGCTGCTGCTGGCGCTGT 
ACACCGCGCGTGTGGACGGGTCCA^iATGCAAGTGCTCCCGGAAGGGACCCAAGATCCGCTAC 
AGCGACGTGAAGAAGCTGGAAATGAAGCCAAAGTACCCGCACTGCGAGGAGAAGATGGTTAT 
CATCACCACCAAGAGCGTGTCCAGGTACCGAGGTCAGGAGCACTGCCTGCACCCCAAGCTGC 
AGAGCACCAAGCGCTTCATCAAGTGGTACAACGCCTGGAACGAGAAGCGCAGGGTCTACGAA 
GAATAGGGTGAAAAACCTCAGAAGGGAAAACTCCAAACCAGTTGGGAGACTTGTGCAAAGGA 
CTTTGCAGATTAAAAAAAAAAAAAAAAAAAAAAAAiyiAAAAAAAAAAAAAAAAAAGCCTTTC 
TTTCTCACAGGCATAAGACACAAATTATATATTGTTATGAAGCACTTTTTACCAACGGTCAG 
TTTTTACATTTTATAGCTGCGTGCGAAAGGCTTCCAGATGGGAGACCCATCTCTCTTGTGCT 
CCAGACTTCATCACAGGCTGCTTTTTATCAAAAAGGGGAAAACTCATGCCTTTCCTTTTTAA 
AAAATGCTTTTTTGTATTTGTCCATACGTCACTATACATCTGAGCTTTATAAGCGCCCGGGA 
GGAACAATGAGCTTGGTGGACACATTTCATTGCAGTGTTGCTCCATTCCTAGCTTGGGAAGC 
TTCCGCTTAGAGGTCCTGGCGCCTCGGCACAGCTGCCACGGGCTCTCCTGGGCTTATGGCCG 
GTCACAGCCTCAGTGTGACTCCACAGTGGCCCCTGTAGCCGGGCAA6CAGGAGCAGGTCTCT 
CTGCATCTGTTCTCTGAGGAACTCAi\GTTTGGTTGCCAGAAAAATGTGCTTCATTCCCCCCT 
GGTTAATTTTTACACACCCTAGGAA2\CATTTCCAAGATCCTGTGATGGCGAGACAAATGATC 
CTTAAAGAAGGTGTGGGGTCTTTCCCAACCTGAGGATTTCTGAAAGGTTCACAGGTTCAATA 
TTTAATGCTTCAGAAGCATGTGAGGTTCCCAACACTGTCAGCAAAAACCTTAGGAGAAAACT 
TAAAAATATATGAATACATGCGCAATACACAGCTACAGACACACATTCTGTTGACAAGGGAA 
AACCTTCAAAGCATGTTTCTTTCCCTCACCACAACAGAACATGCAGTACTAAAGCAATATAT 
TTGTGATTCCCCATGTAATTCTTCAJiTGTTAAACAGTGCAGTCCTCTTTCGAAAGCTAAGAT 
GACCATGCGCCCTTTCCTCTGTACATATACCCTTAAGAACGCCCCCTCCACACACTGCCCCC 
CAGTATATGCCGCATTGTACTGCTGTGTTATATGCTATGTACATGTCAGAAACCATTAGCAT 
TGCATGCAGGTTTCATATTCTTTCT/AGATGGAAAGTAATAAAATATATTTGAAATGTAAAA 
AAAAAAAAAAA 



FIGURE 149 



MSLLPRRAPPVSMRLIAAALLLLLTjALYTARVDGSKCKCSRKGPKIRYSDVKKLEMKPKYPH 
CEEKMVIITTKSVSRYRGQEHCLHPKLQSTKRFIKWYNAWNEKRRVYEE 

Signal sequence: 

amino acids 1-34 



FIGURE 150 



gccccagggactgctatggcttcctttgttgttcaccccggtctgcgtcatgttaaactccaatgtcctcctgtg 

gttaactgctcttgccat(:y^gttcaccctcattgacagcc3^gc:a.cagtatcc^ 

caaiu^tccggggcctaagaacacasttacca^tgagatcttgggtccagtggagcagtacttaggc^ 

tgcctcaccccccactggagagaggcggtttcagcccccagaacccccgtcctcctggactggcatccgaaatac 

tactcagtttgctgctgtgtgcccccagcacctggatgagagatccttactgcatgaawrgctgcccatctggtt 

taccgccaatttggatactttgatgacctatgttcaagatcaaaatgaagactgcctttacttaaacatctacgt 

gcccacgqaagatggagco^cacaaagaaaaacgcagatgatataacgagtaatgaccgtggtg^ 

tattcatgatcagaacagtaagaagcccgtovtggtctatatccatgggggatcttaa^tggagggcaccggc^ 

c^ltgattgacggcagcattttggcaagotacggaaacgtaktcgtqa^ 

agggtttttaagtaccggtgaccaggcago^aaaggcaactatgggctcctggatcagatta^gcaot 

gattgaggagaatgtgggagcctttggcgg<3gaccccaagagagtgaccatctttggctcgggggctggggcctc 

ctgtgtc:agcctgttgacccrgtcccactaotca.gaaggtctcttcc:agaaggccatcattc^ 

cctgtccagctgggcagtgaactaccagcosgccaagtacactcggatattggct^gacy^g^ 

gctggacaccacggacatggtagaatgcctgcggaacaagaactacaaggagctcatccag 

ggccacctaccacatagccttcgggccggt<3atcgacggcgacgtcatcccagacgacccccagatcctgatgga 

go^ggcgagttcctcaactacgacatcatgctgggcgtcaaccaaggggaaggcctgaagttcgtg^ 

cgtggataacgagqacggtgtgacgcccaacgactttgacttctccgtgtccaacttcgtggacaacctttacgg 

ctaccctgaagggaaagacactttgcgggacsactatcaagttcatgtacacagactgggccgataaggaa^ 

ggagacgcggcggaaaaccctggtggctctctttactgaccaccagtgggtggcccccgccgtggccgccgacct 

gcacgcgcagtacggctcccccacctacttctatgccttctatcatcactgccaaagcgaaatgaagcccagctg 

ggcagattcggcccatggtgatgaggtcccc:tatgtcttcggcatccccatgatcggtcccaccgagctcttcag 

ttgtaacttttccaagaacgacgtcatgctcyvgcgccgtggtcatgacctactggacgaacttcgcct^aaactot 

tgatccaaatcs^ccagttcctcaggataccjagttcattcaoicaaaacccs^ 

gtcgaagtataatcccaaagaccagctctatctgcatattggcttqaaacccaqagtgagagatcactaccgggc 

aacgaaagtggctttctggttggaactcgttcctcatttgcacaacttgaacgagatattccagtatgtttc^ 

aaccacaaaggttcctccaccagacatgacjitcatttccctatggcacccggcgatctcccgccaagrtatggcc 

aaco^ccs^aacgcccagct^tcactcctgccauvct^tcccaaacactctaaggaccctc^ 

ggao^ay^ctgtcctcattgaaaccaaacxsagattattccaccgaattaagtgtcacaittgccgtcggg^ 

GCTCCTCTTCera^CATCTTAGCTTTTGCCXX:GCTGTACTACaAAAAGGACyUlGAGGCGCCSkTGAGACT 

GCGCCCCaiGTCCCCAGAGAAACACCACAAATGATATCGCTCACATCCyiGAACGAAGAGATCATGTCTCTGC^ 

GAAGCAGCTGGAACACGATCACGAGTGTGAGfTCGCTGCAGGCACACGACACACTGAGGCTCACCTGCCCGCG?^ 

CTACACCOTCACGCTGCGCCGGTCGCCAGATGACATCCCACTTATGACGCCaAACACCATCACCA 

C3l(:a.CTGACGGGGATGCaGCCTTTGaVCACTTTTAACACCTTCa^GTGGAGGACAAAACAGTAC^^ 

CGGACaTTCCACCACTAGAGT ATAG CTTTGC!CCTATTTCCCTTCCTATCCCTCTGCCCTACCCgCTCAGCAACAT 

AGAAGAGGGAAGGAAAGAQAGAAGGAAAGAGaGAGAGRAAGftAAGTCTCa^GACCAGGAATGTTTTTGTCCCaCT 

GACTTAAGACAAAAATGCAAAAAGGCAGTCATCCCSITCCCGGCAGACCCTTATCGTTGGTGTTTT^ 

AAGATCAACTTCTGACCCTGTGAAATGTGAGAAGTACACATTTCTGTTAAAATAACTGCTTTAAGATCTCTACCA 

CTCCA&TCAATGTTTAGTGTGATAGGACaTCACCATTTCaAGGCCCCGGGTGTTTCCAACGTCATGG 

GAGACTTCTGAAACTCAGCCAAGGACACTTGATAlT'TTTTAATTAG!\ATGGAAGTTTAZUi.^^ 

ai.C».CyATGGATGGCTCTCCTTAAGTGAAGAAAGAGTCaATGAQATTTTGCCCaGCACATGGAGCTGTAATC{^ 

AGAGAAGGAAACGTAGAAATTTATTATTAiy^iAGAATGGACTGTGCAGCGAAATCTGTACGGTTCTGTGCAAAGAG 

GTGTTTTGCCAGCCTGAACTATATTTAAGAG&CTTTGT 



FIGURE 151 

MIjNSlSnn^LWLTAnA.IKFTLIDSQAQYPVVNTNYGKIRGLRTPLPNEILGPVEQYLGVPyASP 

PTGERRFQPPEPPSSWTGIRNTTQF.?^VCPQHLDERSLLHDMLPIWFTANLDTLMTYVQDQN 

EDCLYm I YVPTEDGANTKKNADDI TSNDRGEDED I HDQNS KKP VMVY IHGGS YM 

DGSILASYGNVIVITINYRLGILGFLSTGDQAAKGNYGLLDQIQALRWIEENVGAFGGDPKR 

VTIFGSGAGASCVSLLTLSHYSEGLFQKAIIQSGTALSSWAVNYQPAKYTRILADKVGCNML 

DTTDMVECLRNKNYKELIQQTITPATYHIAFGPVIDGDVIPDDPQILMEQGEFLlSrYDIMLGV 

NQGEGLKFVDGIVDNEDGVTPNDFDFSVSNFVDNLYGYPEGKDTLRETIKFMYTDWADKENP 

ETRRKTLVALFTDHQWVAPAVAADLHAQYGSPTYFYAFYHHCQSEMKPSWADSAHGDEVPYV 

FGIPMIGPTELFSCNFSKNDVMLSA'iATMTYWTNFAKTGDPNQPVPQDTKFIHTKPNRFEEVA 

WSKYNPKDQLYLHIGLKPRVRDHYRJITKYAFWLELVPHLHNLNE I FQYVSTTTKVPPPDMTS 

FPYGTRRSPAKIWPTTKRPAITPANNPKHSKDPHKTGPEDTTVLIETKRDYSTEIiSVTIAVG 

ASLLFLNILAFAALYYKKDKRRHETIIRRPSPQRNTTNDIAHIQNEEIMSLQMKQLEHDHECE 

SLQAHDTLRLTCPPDYTLTLRRSPDDIPLMTPNTITMIPNTLTGMQPLHTFNTFSGGQNSTN 

LPHGHSTTRV 

Signal sequence: 

amino acids 1-24 



Transmembrane domains; 

amino acids 189-204, 675-692 



gggaaagatggcggcgactctgggaccccttgggtcgtggcagcagtggcggcgatgtttgt 
cggctcgggatgggtccaggatgttactccttcttcttttgttggggtctgggcaggggcca 
cagcaagtcggggcgggtcaaacgttcgagtacttgaaacgggagcactcgctgtcgaagcc 
ctaccagggtgtgggcacaggcag']?tcctcactgtggaatctgatgggcaatgccatggtga 
tgacccagtatatccgccttacccciagatatgcaaagtaaacagggtgccttgtggaaccgg 
gtgccatgtttcctgagagactgggagttgcaggtgcacttcaaaatccatggacaaggaaa 
gaagaatctgcatggggatggcttck3caatctggtacacaaaggatcggatgcagccagggc 
ctgtgtttggaaacatggacaaati'tgtggggctgggagtatttgtagacacctaccccaat 
gaggagaagcagcaagagcgggtattcccctacatctcagccatggtgaacaacggctccct 
cagctatgatcatgagcgggatggcjcggcctacagagctgggaggctgcacagccattgtcc 
gcaatcttcattacgacaccttcctggtgattcgctacgtcaagaggcatttgacgataatg 
atggatattgatggcaagcatgagtggagggactgcattgaagtgcccggagtccgcctgcc 
ccgcggctactacttcggcacctcc:tccatcactggggatctctcagataatcatgatgtca 
tttccttgaagttgtttgaactgacagtggagagaaccccagaagaggaaaagctccatcga 
gatgtgttcttgccctcagtggacaatatgaagctgcctgagatgacagctccactgccgcc 

CCTGAGTGGCCTGGCCCTCTTCCTCATCGTCTTTTTCTCCCTGGTGTTTTCTGTATTTGCCA 
TAGTCATTGGTATCATACTCTACA5.CAAATGGCAGGAACAGAGCCGAAAGCGCTTCTACTGA 
GCCCTCCTGCTGCCACCACTTTTGTGACTGTCACCCATGAGGTATGGAAGGAGCAGGCACTG 
GCCTGAGCATGCAGCCTGGAGAGTGTTCTTGTCTCTAGCAGCTGGTTGGGGACTATATTCTG 
TCACTGGAGTTTTGAATGCAGGGACCCCGCATTCCCATGGTTGTGCATGGGGACATCTAACT 
CTGGTCTGGGAAGCCACCCACCCCAGGGCAATGCTGCTGTGATGTGCCTTTCCCTGCAGTCC 
TTCCATGTGGGAGCAGAGGTGTGAR.GAGAATTTACGTGGTTGTGATGCCAAAATCACAGAAC 
AGAATTTCATAGCCCAGGCTGCCGTGTTGTTTGACTCAGAAGGCCCTTCTACTTCAGTTTTG 
AATCCACAAAGAATTAAAAACTGGTAACACCACAGGCTTTCTGACCATCCATTCGTTGGGTT 
TTGCATTTGACCCAACCCTCTGCCTACCTGAGGAGCTTTCTTTGGAAACCAGGATGGAAACT 
TCTTCCCTGCCTTACCTTCCTTTCACTCCATTCATTGTCCTCTCTGTGTGCAACCTGAGCTG 
GGAAAGGCATTTGGATGCCTCTCTGTTGGGGCCTGGGGCTGCAGAACACACCTGCGTTTCAC 
TGGCCTTCATTAGGTGGCCCTAGGGAGATGGCTTTCTGCTTTGGATCACTGTTCCCTAGCAT 
GGGTCTTGGGTCTATTGGCATGTCCATGGCCTTCCCAATCAAGTCTCTTCAGGCCCTCAGTG 
AAGTTTGGCTAAAGGTTGGTGTAAAiaATCAAGAGAAGCCTGGAAGACATCATGGATGCCATG 
GATTAGCTGTGCAACTGACCAGCTCCAGGTTTGATCAAACCAAAAGCAACATTTGTCATGTG 
GTCTGACCATGTGGAGATGTTTCTGGACTTGCTAGAGCCTGCTTAGCT6CATGTTTTGTAGT 
TACGATTTTTGGAATCCCACTTTGAGTGCTGAAAGTGTAAGGAAGCTTTCTTCTTACACCTT 
GGGCTTGGATATTGCCCAGAGAAGAJ^TTTGGCTTTTTTTTTCTTAATGGACAAGAGACAGT 
TGCTGTTCTCATGTTCCAAGTCTGAGAGCAACAGACCCTCATCATCTGTGCCTGGAAGAGTT 
CACTGTCATTGAGCAGCACAGCCTGAGTGCTGGCCTCTGTCAACCCTTATTCCACTGCCTTA 
TTTGACAAGGGGTTACATGCTGCTCACCTTACTGCCCTGGGATTAAATCAGTTACAGGCCAG 
AGTCTCCTTGGAGGGCCTGGAACTCTGAGTCCTCCTATGAACCTCTGTAGCCTAAATGAAAT 
TCTTAAAATCACCGATGGAACCAAAi\AAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCG 
ACCTGCAGTAGGGATAACAGGGTAATAAGCTTGGCCGCCATGG 



FIGURE 153 



X / us r / seqdb2 / s s t /DNA/Dna seqs.min/ss- DMAS 0911 
xsubunit 1 of 1, 348 aa, 1 stop 
xMW: 39711, pi: 8-70, NX(S/T): 1 

MAATLGPLGSWQQWRRCLSARDGSEMLLLLLLLGSGQGPQQVGAGQTFEYLKREHSLSKPYQ 

GVGTGSSSLWNLMGNJyyrmTQYIRLTPDMQSKQGALWNRVPCFLRDWELQVHFKIHGQGKKN 

LHGDGIAIWYTKDRMQPGPVFGNMDKFVGLGVFVDTYPNEEKQQERVFPYISJyyiVNN 

DHERDGRPTELGGCTAIVRlJLHYDI'FLVIRyVKRHLTIMMDIDGKHEWRDCIEVPGWLPRG 

YYFGTSSITGDLSDNHDVISLKLFELTVERTPEEEKLHRDVFLPSVDNMKLPEMTAPLPPLS 

GLALFLIVFFSLVFSVFAIVIGIIKmKWQEQSRKRFY 

Signal secnience; 

amino acids 1-38 

Transmembrane domain: 

amino acids 310-329 



FIGURE 154 



CCGAGCCGGGCGCGCAGCGACGGAGCTGGCJGCCGGCCTGGGACCATGGGCGTGAGTGCAATCTACGGATCAGTCT 

CTGATGGTGGGTCGTTAACCTCAGTGGGG^^CTCCaAGATTTCCATGAAGAAAATCAGTTGTCTT(^^ 

TGGGGTCTGGCTCAGAATTCCTGCaGCTGCSTGAAAATCTGTTTTCTAGAAGAGGTTTAATTjy^TGCOTGCAGTCT 

GACaTGTTCCCGATTTGAGGTGAAACOiTCSftAGAGAAAATAGAATACTTAATliATGCrTT^^ 

TGCTGCTGCTGGCCCTGGCTGCGCTGCTGGJCCTTTGTGAGCCTCAGCCTGCAGTTCTTCCACCTGATCCCGGTGT 

CGACTCCTAAGAATGGAATGAGTAGCAAGPvGTCGAAAGAGAATCATGCCCGACCCTGTGACGGAGCCCCCTGTGA 

CAGACCCCGTTTATGAAGCTCTTTTGTACTGCAACATCCCCAGTGTGGCCGAGCGCAGCATGGAAGGTCAT^ 

CGCATCa.TTTTAAGCTGGTCTCAGTGCATGTGTTCATTCGCCACGGAGACAGGTACCCACTGTATGTCATTCCCA 

AAACaAAGCGACCAGAAATTGACTGCACTCTGGTGGCTAACAGGAAACCGTATCACCCAAAACTGGAAGCTTTC^ 

TTAGTCACATGTCAAAAGGATCCGGAGCCTCTTTCGAAAGCCCCTTGAACTCOTTGCCTCTTTACCCAAATCACC 

CATTGTGTGAQATGGGAGAGCTa^CACAGa.CAGGAGTTGTGCAGCATTTGCAGAACGGTCaVGOT 

TCTATCTAAAGAAACacy^AACTCCTGCCCSATGATTGGTCTGCAGACCAGCTCTATTTAGAGACC^ 

GCCGGACCCTACAAAGTGGGCTGGCCTTGCITTTATGGCTTTCTCCCAGATTTTGACTGGAAGaAGATTTATTTCA 

GGCACCAGCCAAGTGCGCTGTTCTGCTCTGIGAAGCTGCTATTGCCCGGTAAGaAACCAGTATCTGGAAAAGGAGC 

AGCGTCGTCAGTACCTCCTACGTTTGAAAAACAGCCAGCTGGAGAAGACCTACGGGGAGATGGCCAAGATCGTGG 

ATGTCCCCyiCCaAGCmGCTTAGAGCTGCCaACCCCATAGACTCCa^TGCTCTGCCACTTCTGCCAa^TGTC^ 

TTCCCTGTACCAGAAATGGCTGTGTTGACATGGAGCACTTCaAGGTAATTAAGACCCaTCAGATCGAGG^ 

GG6AAAGACGGGAGAAGAAATTGTACTTCGGGTATTCTCTCCTGGGTGCCCACCCCATCCTGAACCAAACCATCG 

GCCGGATGCAGCGTGCCaCCGAGGGCaGGAAAGAAGAGCTCTTTGCCCTCTACTCTGCTCATGATGTCACTCTGT 

CACCAGTTCTO^GTGCCTTGGGCCTTTCAGAAGCCAGGl^CCCAAGGTTTGCAGCCAGGTTGA^ 

GGCAAGACAGAGAAAAGCC<:».GTGAACATTCCGTCCGGATTCTTTACAATGGCGTCGATGTCACaTTCC^ 

CTTTCTGCCAAQACCa.CCACaAGCGTTCTCCCAAGCCCS.TGTGCCCGCTTGA2UU^CTTGGTCCGCT^ 

GGGAOVTGTTTGTAGCCCTGGGTGGCAGTGGTACSU^TTATTATGATGCATGTCACAGGGAAGGATTCT^ 

TATGCAGTACAGC:a.GTATAGAATCCATGCCAATACAGAGCATAGGGAAAGGTCCACTTCTAGTTTTGTCTGTTAC 

TAAGGGTAGAAGATTATTGCTTTTTAAAGGCTAAATATTGTTTGTGGGAACCACAGATGGTTGGGGTTGAACAGT 

AAGa^CaVTTGCTGCAATGTGGTACGTGAATTGCTTGGTAOUUU^TGGCCAGTTCACAGAGG^ 

TATCATAGCCAGACTTCGCTTAGAATGCCAGAATAATATAGTTCAAGACCTGAAGTTGCCAATCCAAGTTTGCAC 

TCTTCTGGCCTGCCCCATGTTACTATGTGATGGAACCAGCa^a^CCTCaACCZWU^TTTTTTTAATCTTAGA^ 

TTTACCTTGTCCTTGTTAAGAATTTCTTGAAGTGATTTATCTAAAATAAAGGTTGGCAAACTTTTTCTGTAAAGG 

6CCAGATTGTAAATATTTCAGACTGTGTGGACC?iAAAGGCCACATACAGTCTCTGTCATAACTA 

TTCTGAAGCAGGAAAGCCACCACAGACAGTACATAAAGGAATATGTGTAGCTGGGTTCCaVGGCCAGACAAAAC^ 

GATGGTGACCAGACTTGGCCCCTGGGCTGTAGTTTGCTGACCCCTCATCTAAAAAATAGGCTATACTACAATTGC 

ACTTCCa.GCACTTTGAGAACGAGTTGAATACCAAGAATTATTCAATGGTTCCTCCAGTAACTTCTGCTAG3yiACa 

Ca.GAATTTGGTCTGTATCTGACACTAGAAa\AAACTTGAGGGTAAATAAACATTGAATTAGAATGAATCATAGAA 

AACTGATTAGAAGAATACTTGATGTTTATffi\TGATTGTGGTACAAGATAGTTTTAAGTATGTTCTAAATATTTGT 

CTGCTGTAGTCTATTTGCTGTATATGCTGAiiATTTTTGTATGCCATTTAGTATTTTTATAGTTTAGGAAAATATT 

TTCTAAGACCAGTTTTAGATGACTCTTATTCCTGTAGTAATATTCAATTTGCTGTACCTGCTTGGTGGTTAGAAG 

GAGGCTAGAAGATGAATTCAGGCACTTTCTTCCAATAAAACTAATTATGGCTCATTCCCTTTGACy^GCTGTA 

ACTGGATTCATTTTTAAACCATTTTCATCAGTTTCAAATGGTAAATTCTGATTGATTTTTAAATGCGTTTTTGGA 

AGAACTTTGCTATTAGGTAGTTTACAGATCTTTATAAGGTGTTTTATATATTAGAAGCAATTATAATTACATCTG 



AACTTTTTCTCTTTGTTTTTGTCCAGTGTTGfCATTTGAATATGTCTGTTTCTATAAATAAATTTTTTAAGAATAA 



FIGURE 155 

></usr/seqdb2/sst/DNA/Dna£!eqs.min/ss.DNA48329 
xsvibunit 1 of 1, 480 aa, 1 stop 
xMW: 55240, pi: 9.30, NX(S/T): 2 

MLFRNRFLLLLALAALLAFVSLSLQFFHLIPVSTPKNGMSSKSRKRIMPDPVTEPPVTDPVY 
EALLYCNIPSVAERSMEGHAPHHFK]jVSVHVFIRHGDRYPLYVIPKTKRPEIDCTLVANRKP 

yhpkleafishmskgsgasfesplnslplypnhpiicemgeijtqtgvvqhlqngqllrdiylk 
khkllpndwsadqlylettgksrtlqsglallygflpdfdwkkiyfrhqpsalfcsgscycp 
vrnqylekeqrrqyllrlknsqlektygemakivdvptkqlraanpidsmlchfchnvsfpc 
trngcvdmehfkvikthqiedererijekklyfgysllgahpilnqtigrmqrategrkeelf 
alysahdvtlspvlsalglsearfprfaarlifelwqdrekpsehsvrilyngvdvtfhtsf 
cqdhhkrspkpmcplenlvrfvkrdmfvalggsgtnyydachregf 

S ignal s equenc e : 

amino acids 1-18 



FIGURE 156 

AAAAAAGCTCACTAJ^GTTTCTATTAC3AGCSAATACGGTAC3ATTTCCATCCCCTTTTGAAGAACAGTACTC^ 

GCTATTTAAGAGATAAAAACGAAATATCCTTTCTGGGAGTTCAAGATTGTGCAGTAATTGGTTAGGACTCTGAGC 

GCCGCTGTTCACCAATCGGGGAGAGAAAAGCGGAGATCCTGCTCGCCTTGCftCGCGCCTGAAGCACAAAGCAGAT 

AGCTAGQAATGAACCATCCCTGGGAGTATGTGGAAACAACGGAGGAGCTCTGACTTCCCAACTGTCCCATTCTAT 

GGGCGAAGGAAOTGCTCCTGACTTCaGTGGTTAAGGGCAGAATTGAAAATAATTCTGGAGGAAGATAAGAATGAT 

TCCTGCGCGACTGCACCGGGACTACAAAGGGCTTGTCCTGCTGGGAATCCTCCTGGGGACTCTGTGGGAGACCGG 

ATGO^CCCaGATACGCTATTCAGTTCCGGAi^GAGCTGGAGAAAGGCTCTAGGGTGGGCGACATCTCCAGGGACCT 

GGGGCTGGAGCCCCGGGAGCTCGCGGAGCGCGGAGTCCGCATCATCCCCAQAGGTAGGACGCAGCTTTTCGCCCT 

QAATCCGCGCAGCGGCAGCTTGGTCACGGCGGGCAGGATAGACCGGGAGGAGCTCTGTATGGGGGCCATCAAGTG 

TCAATTAAATCTAGACATTCTGATGGAGGATAAAGTGAAAATATATGGAGTAGAAGTAGAAGTAAGGGACATTAA 

CGACAATGCGCCTTACTTTCGTGAAAGTGAi^TTAGAAATAAAAATTAGTGAAAATGCAGCCACTGAGATGCGGTT 

ccctctacccc».cgcctgggatccggatat<:gggaagaactctctgcagagctacgagctc:agcccgaacactca 

CTTCTCCCTCATCGTGCAAAATGQAGCCGACGGTAGTAAGTACCCCGAATTGGTGCTGAAACGCGCCCTGGACCG 

cgaagaaaaggctgctcaccacctggtccttacggcctccgacgggggcgacccggtgcgcacaggcaccgcgcg 

CATCCGCGTGATGGTTCTGGATGCGAACGACAACGCACCAGCGTTTGCTCAGCCCGAGTACCGCGCGAGCGTTCC 

GGAGAATCTGGCCTTGGGCACGCAGCTGCTTGTAGTCAACGCTACCGACCCTGACGAAGGAGTCAATGCGGAAGT 

GAGGTATTCCTTCCGGTATGTGGACGACAAC5GCGGCCCAAGTTTTCAAACTAGATTGTAATTCAGGGACAATATC 

AACAATAGGGGAGTTGQACCACGAGQAGTa\GGATTCTACCAGATGQAAGTGCAAGCAATGGATAATGCAGGATA 

TTCTGCGCGAGCCAAAGTCCTGATCa.CTGTTCTGGACGTGAACGACAATGCCCCAGAAGTGGTCCTCACCTCTCT 

CGCCAGCTCGGTTCCCGAAAACTCTCCCAGi^GGGACATTAATTGCCCTTTTAAATGTAAATGACCAAGATTCTGA 

GGAAAACGGACAGGTGATCTGTTTCATCCA^IGGAAATCTGCCCTTTAAATTAGAAAAATCTTACGGAAATTACTA 

TAGTTTAGTCACAGACATAGTCTTGGATAGCK3AACAGGTTCCTAGCTACAACATCACAGTGACCGCCACTGAC 

GGQAACCCCGCCCCTATCCACGGAAACTCAIiATCTCGCTGAACGTGGCAGACACCAACGACAACCCGCCGGTCTT 

CCCTCAGGCCTCCTATTCCGCTTATATCCa^GAGAACAATCCCAGAGGAGTTTCCCTCGTCTCTGTGACCGCCCA 

CGACCCCGACTGTGAAGAGAACGCCCAGATCACTTATTCCCTGGCTGAGAACACCATCCAAGGGGCAAGCCTATC 

GTCCTACGTGTCCATCAACTCCQACACTGGGX3TACTGTATGCGCTGAGCTCCTTCGACTACGAGCAGTTCCGAGA 

CTTGCaAGTGAAAGTGATGGCGCGGGAGRAC:GGGCACCCGCCCCTCAGCAGa^CGTGTCGTTGAGC^ 

GCTGGACa^GAACGACAATGCGCCCGAGATCCTGTACCCCGCCCTCCCCACGGACGGTTCCACTGGCGTGQAGCT 

GGCTCCCCGCTCCGCAGAGCCCGGCTACCTC^GTGACCaAGGTGGTGGCGGTGGACAGAGACTCCGGCCAGAACGC 

CTGGCTGTCCTACCGTCTGCTCAAGGCCAGCGAGCCGGGACTCTTCTCGGTGGGTCTGCACACGGGCGAGGTGCG 

CACGGCGCGAGCCCTGCTGGACaGAGACGCGCTCAAGCaGAGCCTCGTAGTGGCCGTCCAGGACCACGGCCAGCC 

^^S^^?"S^^^^^*=^'=^GTCACGCTCACCGTG!GCCGTGGCCGACAGCATCCCCC^^ 

CCTCGAGTCTCCAGCTAACTCTGAAACCTC^.QACCTCACTCTGTACCTGGTGGTAGCGGTGGCCGCGGTCTCCTG 

^I^^®^^^"^'^'^°^^®^^'^°^*=^^«'rCGCACTTTGTGGGCGTGGACGGGGTGCAGGCTTTC 
S^™S^?^^^®5TTTCCCTCACCaCGGACTCGCGGAAGAGTC».CCTGATCTTCCCCCA^ 

CATGCTCGTCAGCCAGGAGAGCTTTGAAAAftAGCQAGCCCCTTTTGCTGTCAGGTGATTCGGTATTTTCTAaAGA 

CAGTCATGGGTTAATTGAGGTGAGTTTATATCAAATCTTCTTTCTTTTTTTTTTTAATTGCTCTGTCTCCCAAGC 

^^^^^^^^^^■^^^^■^^■^^^^'^^^G'^^CCTCAAACTCCTAGGCTCAAGC^ 

SS??^S?'^*^^5^C:TACAGGTGCAAGCra.CCTACTGTCTGCCTATCTATC^^^ 

™^^^^^^^^^^"^^^*^^^'^^^^^^'^'==^'rG'^'ACAGACGGGAGTCTCACGCCTGTA^^ 

CGAGGCGGGTGGAracCTGAGGTTGGGAGTTTGAGACCAGCCTGACCAACATGGAGAAACCCCGTCTATACTAA 
AAAAATACAAAATTAGCCGGGCGTGGTGGTGCATGTCTGTAATCCCAGCTACTTGGGAGGCTGAGTCAGGAG^ 
i^^?^ ^^^'^^^^^'^^^^'^^'^■^^^^CTGAGATTGTGCCy^TTGCACTC^ 
AAACTCTATCTCA 



FIGURE 157 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA48306 
><st±>unit 1 of 1, 916 aa, 1 stop 
xMW: 100204, pi: 4.92, NX(S/T): 4 

MIPARLHRDYKGLVLLGILLGTLWKTGCTQIRYSVPEELEKGSRVGDISRDLGLEPREIiAER 

GVRIIPRGRTQIiFALNPRSGSLVTAGRIDREELCMGAIKCQLNLDIIiMEDKVKIYGVEVEVR 

DINDNAPYFRESELEIKISEN.2y^TEMRFPLPHAWDPDIGKNSLQSYELSPNTHFSLIVQNGA 

DGSKYPELVLKRALDREEKAAHHL\rLTASDGGDPVRTGTARIRVMVLDANDNAPAFAQPEYR 

ASVPENLALGTQLLVVNATDPDEGVNAEVRYSFRYVDDKAAQVFKLDCNSGTISTIGELDHE 

ESGFYQMEVQAMDNAGYSARAKVLn:TVLDWDNAPEVVLTSLASSVPENSPRGTLIALr^rV^ 

DQDSEENGQVICFIQGNLPFKLEKSYGNYYSLVTDIVLDREQVPSYNITVTATDRGTPPIiST 

ETHISIJSrVADTNDNPPVFPQASYSAYIPENNPRGVSLVSWAHDPDCEENAQITYSIJ^OT 

QGASLSSYVSINSDTGVLYALSSFDYEQFRDLQVKVMARDNGHPPLSSNVSLSLFVLDQNDN 

APEILYPALPTDGSTGVELAPRSAEPGYLVTKWAVDRDSGQNAWLSYRLLKASEPGLFSVG 

LHTGEVRTARALLDRDALKQSLWA.VQDHGQPPLSATVTLTVAVADSIPQVLADLGSLESPA 

NSETSDLTLYLWAVAAVSCVFLAF7ILLLALRLRRWHKSRLLQASGGGLTGAPASHFVGVD 

GVQAFLQTYSHEVSLTTDSRKSHLIFPQPNYADMLVSQESFEKSEPLLLSGDSVFSKDSHGL 

I EVSLYQI FFLFFFNCSVSQAGVQRYDHSSLRPQTPRLKQLSHLCLRCNRDYRCKPPTVCLS 

lYLSIYLSIYLSIYLLLSCTDGSLTPVIPVLWEAEAGGSPEVGSLRPA 

Signal sequence: 

amino acids 1-30 

Transmembrane domains: 

amino acids 693-711, 809-823, 869-888 



FIGURE 158 



CCCAGGCTCTAGTGCAGGAGGAGAi!kGGAGGAGGAGCAGGAGGTGGAGATTCCCAGTTAAAAG 
GCTCCAGAATCGTGTACCAGGCAGAGAACTGAAGTACTGGGGCCTCCTCCACTGGGTCCGAA 
TCAGTAGGTGACCCCGCCGCTGGATTCTGGAAGACCTCACCATGGGACGCCCCCGACCTCGT 
GCGGCCAAGACGTGGATGTTCCTGCTCTTGCTGGGGGGAGCCTGGGCAGGACACTCCAGGGC 
ACAGGAGGACAAGGTGCTGGGGGGTCATGAGTGCCAACCCCATTCGCAGCCTTGGCAGGCGG 
CCTTGTTCCAGGGCCAGCAACTACTCTGTGGCGGTGTCCTTGTAGGTGGCAACTGGGTCCTT 
ACAGCTGCCCACTGTAAAAAACCG^^TACACAGTACGCCTGGGAGACCACAGCCTACAGAA 
TAAAGATGGCCCAGAGCAAGAAATA.CCTGTGGTTCAGTCCATCCCACACCCCTGCTACAACA 
GCAGCGATGTGGAGGACCACAACCATGATCTGATGCTTCTTCAACTGCGTGACCAGGCATCC 
CTGGGGTCCAAAGTGAAGCCCATCJs.GCCTGGCAGATCATTGCACCCAGCCTGGCCAGAAGTG 
CACCGTCTCAGGCTGGGGCACTGTCACCAGTCCCCGAGAGAATTTTCCTGACACTCTCAACT 
GTGCAGAAGTAAAAATCTTTCCCCAGAAGAAGTGTGAGGATGCTTACCCGGGGCAGATCACA 
GATGGCATGGTCTGTGCAGGCAGCAGCAAAGGGGCTGACACGTGCCAGGGCGATTCTGGAGG 
CCCCCTGGTGTGT6ATGGTGCACTCCAGGGCATCACATCCTGGGGCTCAGACCCCTGTGGGA 
GGTCCGACAAACCTGGCGTCTATACCAACATCTGCCGCTACCTGGACTGGATCAAGAAGATC 
ATAGGCAGCAAGGGCTGATTCTAGGATAAGCACTAGATCTCCCTTAATAAACTCACAACTCT 
CTGGTTC 



FIGURE 159 



</usr/segdb2/sst/DNA/Dnaseqs.min/ss.DNA48336 
<subunit 1 of 1, 260 aa, 1 stop 
<MW: 28048, pi: 7.87, NX(S/T): 1 

MGRPRPRAAKTWMFLLLLGGAWAGHSRAQEDKVLGGHECQPHSQPWQAALFQGQQLLCGGVL 
VGGNWVLTAAHCKKPKYTVRLGDHSLQNKDGPEQEIPWQS I PHPCYNSSDVEDHNHDLMLL 
QLRDQASLGSKVTCPISIJyDHCTQPGQKCTVSGWGTVTSPRENFPDTLNCAEVKIFPQKKCED 
AYPGQITDGMVCAGSSKGADTCQGDSGGPLVCDGALQGITSWGSDPCGRSDKPGVYTNICRY 
LDWIKKIIGSKG 

Important Features: 
Signal peptide: 

amino acids 1-23 

Transmembrane domain: 

amino acids 51-71 

N-glycosylation site, 
amino acids 110-113 

Serine proteases, trypsin family, histidine active site. 

amino acids 69-74 and 207-217 

Tyrosine kinase phosphorylation site. 

amino acids 182-188 

Kringle domain proteins motif 

amino acids 205-217 



FIGURE 160 



GGCGCCGGTGCACCGGGCGGGCTGA(3CGCCTCCTGCGGCCCGGCCTGCGCGCCCCGGCCCGC 
CGCGCCGCCCACGCCCCAACCCCGGCCCGCGCCCCCTAGCCCCCGCCCGGGCCCGCGCCCGC 
GCCCGCGCCCAGGTGAGCGCTCCGCCCGCCGCGAGGCCCCGCCCCGGCCCGCCCCCGCCCCG 
CCCCGGCCGGCGGGGGAACCGGGCGGATTCCTCGCGCGTCAAACCACCTGATCCCATAAAAC 
ATTCATCCTCCCGGCGGCCCGCGCTGCGAGCGCCCCGCCAGTCCGCGCCGCCGCCGCCCTCG 
CCCTGTGCGCCCTGCGCGCCCTGCGCACCCGCGGCCCGAGCCCAGCCAGAGCCGGGCGGAGC 
GGAGCGCGCCGAGCCTCGTCCCGCGGCCGGGCCGGGGCCGGGCCGTAGCGGCGGCGCCTGGA 
TGCGGACCCGGCCGCGGGGAGACGGCJCGCCCGCCCCGAAACGACTTTCAGTCCCCGACGCGC 
CCCGCCCAACCCCTACGATGAAGAGGGCGTCCGCTGGAGGGAGCCGGCTGCTGGCATGGGTG 
CTGTGGCTGCAGGCCTGGCAGGTGGCAGCCCCATGCCCAGGTGCCTGCGTATGCTACAATGA 

gcccaaggtgacgacaagctgccccciagcagggcctgcaggctgtgcccgtgggcatccctg 
ctgccagccagcgcatcttcctgcac:ggcaaccgcatctcgcatgtgccagctgccagcttc 
cgtgcctgccgcaacctcaccatcctgtggctgcactcgaatgtgctggccc6aattgatgc 
ggctgccttcactggcctggccctcctggagcagctggacctcagcgataatgcacagctcc 
ggtctgtggaccctgccacattccacggcctgggccgcctacacacgctgcacctggaccgc 
tgcggcctgcaggagctgggcccggggctgttccgcggcctggctgccctgcagtacctcta 
cctgcaggacaacgcgctgcaggcactgcctgatgacaccttccgcgacctgggcaacctca 
cacacctcttcctgcacggcaaccgciatctccagcgtgcccgagcgcgccttccgtgggctg 
cacagcctcgaccgtctcctactgcaccagaaccgcgtggcccatgtgcacccgcatgcctt 
ccgtgaccttggccgcctcatgacagtctatctgtttgccaacaatctatcagcgctgccca 
ctgaggccctggcccccctgcgtgccctgcagtacctgaggctcaacgacaacccctgggtg 
tgtgactgccgggcacgcccactctcjggcctggctgcagaagttccgcggctcctcctccga 
ggtgccctgcagcctcccgcaacgcc:tggctggccgtgacctcaaacgcctagctgccaatg 
acctgcagggctgcgctgtggccaccggcccttaccatcccatctggaccggcagggccacc 
gatgaggagccgctggggcttcccaijlgtgctgccagccagatgccgctgacaaggcctcagt 
actggagcctggaagaccagcttcggcaggcaatgcgctgaagggacgcgtgccgcccggtg 
acagcccgccgggcaacggctctggcccacggcacatcaatgactcaccctttgggactctg 
cctggctctgctgagcccccgctcactgcagtgcggcccgagggctccgagccaccagggtt 
ccccacctcgggccctcgccggaggccaggctgttcacgcaagaaccgcacccgcagccact 
gccgtctgggccaggcaggcagcggg'rggtggcgggactggtgactcagaaggctcaggtgcc 
ctacccagcctcacctgcagcctcacccccctgggcctggcgctggtgctgtggacagtgct 
tgggccctgctgacccccagcggacacaagagcgtgctcagcagccaggtgtgtgtacatac 
ggggtctctctccacgccgccaagcgagccgggcggccgacccgtggggcaggccaggccag 
gtcctccctgatggacgcctgccgcc!cgccacccccatctccaccccatcatgtttacaggg 
ttcggcggcagcgtttgttccagaac:gccgcctcccacccagatcgcggtatatagagatat 
gcattttattttacttgtgtaaaaatatcggacgacgtggaataaagagctcttttcttaaa 

AAAA 



FIGURE 161 



></usr/seqdb2/sst/DNA/Dnaseqs .rain/ss .DNA44184 
xsubunit 1 of 1, 473 aa, 1 stop 
xMW: 50708, pi: 9.28, NX(S/T): 6 

MKRASAGGSRLLAWVLWLQAWQVAAPCPGACVCYNEPKVTTSCPQQGLQAVPVGIPAASQRI 
FLHGNRISHVPAASFRACRNLTILWLHSNVLARIDAAAFTGLALLEQLDLSDNAQLRSVDPA 
TFHGLGRLHTLHLDRCGLQELGPGLFRGLAALQYLYLQDNALQALPDDTFRDLGNLTHLFLH 
GNRISSVPERAFRGLHSLDRLLLHQNRVAHVHPHAFRDLGRLMTLYLFANNLSALPTEALAP 
LRALQYLRLNDNPWCDCRARPLWAWLQKFRGSSSEVPCSLPQRLAGRDLKRLAANDLQGCA 
VATGPYHPIWTGRATDEEPLGIiPKCCQPDAADKASVLEPGRPASAGNALKGRVPPGDSPPGN 
GSGPRHINDSPFGTLPGSAEPPLTAirRPEGSEPPGFPTSGPRRRPGCSRKNRTRSHCRLGQA 
GSGGGGTGDSEGSGALPSLTCSLTPLGLALVLWTVLGPC 

Important features : 
Signal peptide: 

amino acids 1-26 

Leucine zipper pattern. 

amino acids 135-156 

Glycosaminoglycan attachment site. 

amino acids 436-439 

N-glycosylation site. 

amino acids 82-85, 179-183, 237-240, 372-375 and 423-426 

VWFC domain 

amino acids 411-425 



GGAAGTCCACGGGGAGCTTGGATGCCAAAGGGAGGACGGCTGGGTCCTCTGGAGAGGACTAC 
TCACTGGCATATTTCTGAGGTATCTGTAGAATAACCACAGCCTCAGATACTGGGGACTTTAC 
AGTCCCACAGAACCGTCCTCCCAGGAAGCTGAATCCAGCAAGAACAATGGAGGCCAGCGGGA 
AGCTCATTTGCAGACAAAGGCAAGTCCTTTTTTCCTTTCTCCTTTTGGGCTTATCTCTGGCG 
GGCGCGGCGGAACCTAGAAGCTATTCTGTGGTGGAGGAAACTGAGGGCAGCTCCTTTGTCAC 
CAATTTAGCAAAGGACCTGGGTCTGGAGCAGAGGGAATTCTCCAGGCGGGGGGTTAGGGTTG 
TTTCCAGAGGGAACAAACTACATTTGCAGCTCAATCAGGAGACCGCGGATTTGTTGCTAAAT 
GAGAAATTGGACCGTGAGGATCTGTGCGGTCACACAGAGCCCTGTGTGCTACGTTTCCAAGT 
GTTGCTAGAGAGTCCCTTCGAGTTTTTTCAAGCTGAGCTGCAAGTAATAGACATAAACGACC 
ACTCTCCAGTATTTCTGGACAAACAAATGTTGGTGAAAGTATCAGAGAGCAGTCCTCCTGGG 
ACTACGTTTCCTCTGAAGAATGCCG?^GACTTAGATGTAGGCCAAAACAATATTGAGAACTA 
TATAATCAGCCCCAACTCCTATTTTCGGGTCCTCACCCGCAAACGCAGTGATGGCAGGAAAT 
ACCCAGAGCTGGTGCTGGACAAAGCGCTGGACCGAGAGGAAGAAGCTGAGCTCAGGTTAACA 
CTCACAGCACTGGATGGTGGCTCTCCGCCCAGATCTGGCACTGCTCAGGTCTACATCGAAGT 
CCTGGATGTCAACGATAATGCCCCTGAATTTGAGCAGCCTTTCTATAGAGTGCAGATCTCTG 
AGGACAGTCCGGTAGGCTTCCTGGTTGTGAAGGTCTCTGCCACGGATGTAGACACAGGAGTC 
AACGGAGAGATTTCCTATTCACTTTTCCAAGCTTCAGAAGAGATTGGCAAAACCTTTAAGAT 
CAATCCCTTGACAGGAGAAATTGAACTAAAAAAACAACTCGATTTCGAAAAACTTCAGTCCT 
ATGAAGTCAATATTGAGGCAAGA6ATGCTGGAACCTTTTCTGGAAAATGCACCGTTCTGATT 
CAAGTGATAGATGTGAACGACCATGCCCCAGAAGTTACCATGTCTGCATTTACCAGCCCAAT 
ACCTGAGAACGCGCCTGAAACTGTGGTTGCACTTTTCAGTGTTTCAGATCTTGATTCAGGAG 
AAAATGGGAAAATTAGTTGCTCCATTCAGGAGGATCTACCCTTCCTCCTGAAATCCGCGGAA 
AACTTTTACACCCTACTAACGGAGAGACCACTAGACAGAGAAAGCAGAGCGGAATACAACAT 
CACTATCACTGTCACTGACTTGGGGACCCCTATGCTGATAACACAGCTCAATATGACCGTGC 
TGATCGCCGATGTCAATGACAACGCTCCCGCCTTCACCCAAACCTCCTACACCCTGTTCGTC 
CGCGAGAACAACAGCCCCGCCCTGCI^CATCCGCAGCGTCAGCGCTACAGACAGAGACTCAGG 

caccaacgcccaggtcacctactcgc:tgctgccgccccaggacccgcacctgcccctcacat 

CCCTGGTCTCCATCAACGCGGACAACGGCCACCTGTTCGCCCTCAGGTCTCTGGACTACGAG 

gccctgcaggggttccagttccgcgtgggcgcttcagaccacggctccccggcgctgagcag 
cgaggcgctggtgcgcgtggtggtgc:tggacgccaacgacaactcgcccttcgtgctgtacc 
cgctgcagaacggctccgcgccctgc:accgagctggtgccccgggcggccgagccgggctac 
ctggtgaccaaggtggtggcggtggacggcgactcgggccagaacgcctggctgtcgtacca 
gctgctcaaggccacggagctcggtcjtgttcggcgtgtgggcgcacaatggcgaggtgcgca 
ccgccaggctgctgagcgagcgcgacgcggccaagcacaggctggtggtgctggtcaaggac 
aatggcgagcctccgcgctcggccagcgccacgctgcacgtgctcctggtggacggcttctc 
ccagccctacctgcctctcccggaggcggccccgacccaggcccaggccgacttgctcaccg 
tctacctggtggtggcgttggcctcg-gtgtcttcgctcttcctcttttcggtgctcctgttc 
gtggcggtgcggctgtgtaggaggagcagggcggcctcggtgggtcgctgcttggtgcccga 

GGGCCCCCTTCCAGGGCATCTTGTGCJACATGAGCGGCACCAGGACCCTATCCCAGAGCTACC 

agtatgaggtgtgtctggcaggaggctcagggaccaatgagttcaagttcctgaagccgatt 

ATCCCCAACTTCCCTCCCCAGTGCCCTGGGAAAGAAATACAAGGAAATTCTACCTTCCCCAA 

taactttgggttcaatattcagtgaccatagttgacttttacattccataggtattttattt 

TGTGGCATTTCCATGCCAATGTTTATTTCCCCCAATTTGTGTGTATGTAATATTGTACGGAT 

ttactcttgatttttctcatgttctttctccctttgttttaaagtgaacatttacctttatt 
cctggttctt 



FIGURE 163 



< / us r / s eqdb2 / s s t /DNA/ Dna seqs,min/ss. DNA4 8314 
<subunit 1 of 1, 798 aa, 1 stop 
<MW: 87552, pi: 4.84, NX(S/T): 5 

MEASGKLICRQRQVLFSFLLLGLSIjAGAAEPRSYSWEETEGSSFVTNLAKDLGLEQREFSR 
RGVRWSRGNKLHLQLNQETADLLLNEKLDREDLCGHTEPCVLRFQVLLESPFEFFQAELQV 
IDINDHSPVFLDKQMLVKVSESSPPGTTFPLKNAEDLDVGQNNIENYIISPNSYFRVLTRKR 
SDGRKYPELVLDKALDREEEAELRIiTLTALDGGSPPRSGTAQVYIEVLDVNDNAPEFEQPFY 
RVQI SEDSPVGFLWKVSATDVDTGVNGE I S YSLFQASEE IGKTFKINPLTGE I ELKKQLDF 
EKIjQSYEWIEARDAGTFSGKCTVIilQVIDVMDHAPEVTMSAFTSPIPENAPETVVALFSVS 
DLDSGENGKISCSIQEDLPFLLKSAENFYTLLTERPLDRESRAEYNITITVTDLGTPMLITQ 
LIMTVLIADVNDNAPAFTQTSYTLFWENNSPALHIRSVSATDRDSGTNAQVTYSLLPPQDP 
HLPLTSLVS INADNGHLFALRSLD-^EALQGFQFRVGASDHGSPALSS EALVRVWLDANDNS 
PPVLYPLQNGSAPCTELVPRAAEPGYLVTKWAVDGDSGQNAWLSYQLLKATELGLFGWAH 
NGEVRTARLLSERDAAKHRLVVTiVKDNGEPPRSATATLHVLLVDGFSQPYLPLPEAAPTQAQ 
ADLLTVYLWALASVSSLFLFSVLLFVAVRLCRRSRAASVGRCLVPEGPLPGHLVDMSGTRT 
LSQSYQYEVCLAGGSGTNEFKFLKPIIPNFPPQCPGKEIQGNSTFPNNFGFNIQ 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 685-712 

Cadlierins extracellular respeated domain signature. 

amino acids 122-132, 231-241, 336-346, 439-449 and 549-559 

ATP/GTP- binding site motif A (P-loop) . 

amino acids 285-292 

N-glycosylation site. 

amino acids 418-421, 436-439, 567-570 and 786-789 



FIGURE 164 



acccacgcgtccgcccacgcgtccgcccacgcgtccgcccacgcgtccgcgcgtagccgtgc 
gccgattgcctctcggcctqgqc a;\tgg tcccggctgccggtcgacgaccgccccgcgtcat 
gcggctcctcggctggtggcaagtattgctgtgggtgctgggacttcccgtccgcggcgtgg 
aggttgcagaggaaagtggtcgcttatggtcagaggagcagcctgctcaccctctccaggtg 
ggggctgtgtacctgggtgaggaggagctcctgcatgacccgatgggccaggacagggcagc 
agaagaggccaatgcggtgctggggctggacacccaaggcgatcacatggtgatgctgtctg 
tgattcctggggaagctgaggaca2u^gtgagttcagagcctagcggcgtcacctgtggtgct 
ggaggagcggaggactcaaggtgc5acgtccgagagagccttttctctctggatggcgctgg 
agcacacttccctgacagagaagaggagtattacacagagccagaagtggcggaatctgacg 

CAGCCCCGACAGAGGACTCCAATA^lCACTGAAAGTCTGAAATCCCCAAAGGTGAACTGTGAG 
GAGAGAAACATTACAGGATTAGAAAATTTCACTCTGAAAATTTTAAATATGTCACAGGACCT 
TATGGATTTTCTGAACCCAAACGGTAGTGACTGTACTCTAGTCCTGTTTTACACCCCGTGGT 
GCCGCTTTTCTGCCAGTTTGGCCCCTCACTTTAACTCTCTGCCCCGGGCATTTCCAGCTCTT 
CACTTTTTGGCACTGGATGCATCTCAGCACAGCAGCCTTTCTACCAGGTTTGGCACCGTAGC 
TGTTCCTAATATTTTATTATTTCA2^.GGAGCTAAACCAATGGCCAGATTTAATCATACAGATC 
GAACACTGGAAACACTGAAAATCTTCATTTTTAATCAGACAGGTATAGAAGCCAAGAAGAAT 

gtggtggtaactcaagccgaccaai^.taggccctcttcccagcactttgataaaaagtgtgga 
ctggttgcttgtattttccttattctttttaattagttttattatgtatgctaccattcgaa 
ctgagagtattcggtggctaattccaggacaagagcaggaacatgtgga gtagt gatggtct 
gaaagaagttggaaagaggaacttcaatccttcgtttcagaaattagtgctacagtttcata 
cattttctccagtgacgtgttgacttga2\acttcaggcagattaaaagaatcatttgttgaa 
caactgaatgtataaaaaaattataaactggtgttttaactagtattgcaataagcaaatgc 
aaaaatattcaatag 



FIGURE 165 



X / us r / seqdb2 / s s t /DNA/Dna s eqs . min/ s s . DNA4 8 3 3 3 
xsiibunit 1 of 1, 360 aa, 1 stop 
xMW: 39885, pi: 4.79, NX(S/T): 7 

MVPAAGRRPPRVMRLLGWWQVLLWVLGLPVRGVEVAEESGRLWSEEQPimPLQVGAVYLGEE 
ELLHDPMGQDRAAEEANAVLGLDTQGDHMVMLSVIPGEAEDKVSSEPSGVTCGAGGAEDSRC 
NVRESLFSLDGAGAHFPDREEEYYTEPEVAESDAAPTEDSNNTESLKSPKVNCEERNITGLE 
NFTLKILNMSQDLMDFLNPNGSDCTLVLFYTPWCRFSASLAPHFNSLPRAFPALHFLALDAS 
QHSSLSTRFGTVAVPNILLFQGAKPI^ARFNHTDRTLETLKIFIFNQTGIEAKKNVVVTQADQ 
IGPLPSTLIKSVDWLLVFSLFFLISFIMYATIRTESIRWLIPGQEQEHVE 

Important features: 
Signal peptide: 

amino acids 1-25 

Transmembrane domain: 

amino acids 321-340 

Homologous region to dilsufide isomerase 

amino acids 212-302 

N-glycosylation site. 

amino acids 165-168, 181-184, 187-190, 194-197, 206-209, 278-281 
and 293-296 

Thioredoxin domain 

amino acids 211-227 
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CCCGGCTCCGCTCCCTCTGCCCCCTCGGGGTCGCGCGCCCAC GATGC TGCAGGGCCCTGGCT 
CGCTGCTGCTGCTCTTCCTCGCCTCIGCACTGCTGCCTGGGCTCGGCGCGCGGGCTCTTCCTC 
TTTGGCCAGCCCGACTTCTCCTACAAGCGCAGCAATTGCAAGCCCATCCCGGTCAACCTGCA 
GCTGTGCCACGGCATCGAATACCAGAACATGCGGCTGCCCAACCTGCTGGGCCACGAGACCA 
TGAAGGAGGTGCTGGAGCAGGCCGGCGCTTGGATCCCGCTGGTCATGAAGCAGTGCCACCCG 
GACACCAAGAAGTTCCTGTGCTCGCTCTTCGCCCCCGTCTGCCTCGATGACCTAGACGAGAC 
CATCCAGCCATGCCACTCGCTCTGCGTGCAGGTGAAGGACCGCTGCGCCCCGGTCATGTCCG 
CCTTCGGCTTCCCCTGGCCCGACATGCTTGAGTGCGACCGTTTCCCCCAGGACAACGACCTT 
TGCATCCCCCTCGCTAGCAGCGACCACCTCCTGCCAGCCACCGAGGAAGCTCCAAAGGTATG 
TGAAGCCTGCAAAAATAAAAATGATGATGACAACGACATAATGGAAACGCTTTGTAAAAATG 
ATTTTGCACTGAAAATAAAAGTGAAGGAGATAACCTACATCAACCGAGATACCAAAATCATC 
CTGGAGACCAAGAGCAAGACCATTTACAAGCTGAACGGTGTGTCCGAAAGGGACCTGAAGAA 
ATCGGTGCTGTGGCTCAAAGACAGCTTGCAGTGCACCTGTGAGGAGATGAACGACATCAACG 
CGCCCTATCTGGTCATGGGACAGAA?ICAGGGTGGGGAGCTGGTGATCACCTCGGTGAAGCGG 
TGGCAGAAGGGGCAGAGAGAGTTCA^GCGCATCTCCCGCAGCATCCGCAAGCTGCAGTGCTA 
GTCCCGGCATCCTGATGGCTCCGAC21GGCCTGCTCCAGAGCACGGCTGACCATTTCTGCTCC 
GGGATCTCAGCTCCCGTTCCCCAAGCACACTCCTAGCTGCTCCAGTCTCAGCCTGGGCAGCT 
TCCCCCTGCCTTTTGCACGTTTGCATCCCCAGCATTTCCTGAGTTATAAGGCCACAGGAGTG 
GATAGCTGTTTTCACCTAAAGGAAA^iGCCCACCCGAATCTTGTAGAAATATTCAAACTAATA 
AAATCATGAATATTTTAA 



FIGURE 167 

></usr/ seqdb2/ sst/DNA/Dnasegs .min/ss .DNA50920 
xsxibiinit 1 of 1, 295 aa, 1 stop 
xMW: 33518, pi: 7.74, NX(S/T): 0 

MLQGPGSLLLLFLASHCCLGSARGLFLFGQPDFSYKRSNCKPIPVNLQLCHGIEYQNMRLPN 
LLGHETMKEVLEQAGAWIPLVMKQCHPDTPCKFLCSLFAPVCLDDLDETIQPCHSLCVQVKDR 
CAPVMSAFGFPWPDMLECDRFPQDNDLCIPLASSDHLLPATEEAPKVCEACKNKNDDDNDIM 
ETLCKtroFALKIKVTCEITYINRDTKIILETKSiCTIYKLNGVSERDLKKSVLWLKDSLQCTCE 
EMNDINAPYLVMGQKQGGELVITSVliCRWQKGQREFKRISRSIRKLQC 

Important features: 
Signal peptide: 

amino acids 1-20 

Cysteine rich domain, homolgous to frizzled N terminus 

amino acids 6-153 



FIGURE 168 



gtggaggccgccgacgatggcggggccgacggaggccgagacggggttggccgagccccggg 
ccctgtgcgcgcagcggggccaccgcacctacgcgcgccgctgggtgttcctgctcgcgatc 
agcctgctcaactgctccaac6ccacgctgtggctcagctttgcacctgtggctgacgtcat 
tgctgaggacttggtcctgtccatggagcagatcaactggctgtcactggtctacctcgtgg 
tatccaccccatttggcgtggcggccatctggatcctggactccgtcgggctccgtgcggcg 
accatcctgggtgcgtggctgaactttgccgggagtgtgctacgcatggtgccctgcatggt 
tgttgggacccaaaacccatttgcgttcctcatgggtggccagagcctctgtgcccttgccc 
agagcctggtcatcttctctccagccaagctggctgccttgtggttcccagagcaccagcga 
gccacggccaacatgctcgccaccatgtcgaaccctctgggcgtccttgtggccaatgtgct 
gtcccctgtgctggtcaagaagggtgaggacattccgttaatgctcggtgtctataccatcc 
ctgctggcgtcgtctgcctgctgtccaccatctgcctgtgggagagtgtgccccccaccccg 
ccctctgccggggctgccagctccacctcagagaagttcctggatgggctcaagctgcagct 
catgtggaacaaggcctatgtcatcctggctgtgtgcttggggggaatgatcgggatctctg 
ccagcttctcagccctcctggagcagatcctctgtgcaagcggccactccagtgggttttcc 
ggcctctgtggcgctctcttcatcacgtttgggatcctgggggcactggctctcggccccta 
tgtggaccggaccaagcacttcactgaggccaccaagattggcctgtgcctgttctctct6g 
cctgcgtgccctttgccctggtgtcccagctgcagggacagacccttgccctggctgccacc 
tgctcgctgctcgggctgtttggcttctcggtgggccccgtggccatggagttggcggtcga 
gtgttccttccccgtgggggagggggctgccacaggcatgatctttgtgctggggcaggccg 
agggaatactcatcatgctggcaatgacggcactgactgtgcgacgctcggagccgtccttg 
tccacctgccagcagggggaggatccacttgactggacagtgtctctgctgctgatggccgg 
cctgtgcaccttcttcagctgcatcctggcggtcttcttccacaccccataccggcgcctgc 
aggccgagtctggggagcccccctcc::acccgtaacgccgtgggcggcgcagactcagggccg 
ggtgtggaccgagggggagcaggaagggctggggtcctggggcccagcacggcgactccgga 
gtgcacggcgaggggggcctcgctagaggaccccagagggcccgggagcccccacccagcct 
gccaccgagcgactccccgtgcgca21ggcccagcagccaccgacgcgccctcccgccccggc 
agactcgcaggcagggtccaagcgtccaggtttattgacccggctgggtctcactcctcctt 
ctcctccccgtgggtgatcac gtag ctgagcgccttgtagtccaggttgcccgccacatcga 
tggaggcgaactggaacatctggtccacctgcgggcgggggcgaaagggctccttgcgggct 
ccgggagcgaattacaagcgcgcacctgaaaa 



></usr/seqdb2/ sst/DNA/Dnaseqs .min/ss .DNA50988 
xsubunit 1 of 1, 560 aa, 1 stop 
xMW: 58427, pi: 6.86, NX(S/T): 2 

MAGPTEAETGLAEPRALCAQRGHRl'YARRWVFLLAISLLNCSNATLWLSFAPVADVIAEDLV 
LSMEQINWLSLVYLWSTPFGVAAIWILDSVGLRAATILGAWLNFAGSVLRMVPCMWGTQN 
PFAFLMGGQSLCALAQSLVIFSPAKLAALWFPEHQRATANMLATMSNPLGVLVANVLSPVLV 
KKGEDIPLMLGVYTIPAGWCLLSTICLWESVPPTPPSAGAASSTSEKFLDGLKLQLMWNKA 
YVILAVCLGGMIGISASFSALLEQILCASGHSSGFSGLCGALFITFGILGALALGPYVDRTK 
HFTEATKIGLCLFSLACVPFALVSQ'LQGQTLALAATCSLLGLFGFSVGPVAMELAVECSFPV 
GEGAATGMI FVLGQAEGI LIMLAMTALTVRRSEPSLSTCQQGEDPLDWTVSLLLMAGLCTFF 
SCILAVFFHTPYRRLQAESGEPPSTRNAVGGADSGPGVDRGGAGRAGVLGPSTATPECTARG 
ASLEDPRGPGSPHPACHRATPRAQGP.AATDAPSRPGRLAGRVQASRFIDPAGSHSSFSSPWVIT 

Important features: 
Signal peptide: 

amino acids 1-44 

Transmembrane domains: 

amino acids 61-79, 98-112, 126-146, 169-182, 201-215, 248-268, 
280-300, 318-337, 341-357, 375-387, 420-441 

N-glycosylation site. 

amino acids 40-43 and 43-46 

Glycosaminoglycan attachme.nt site. 

amino acids 468-471 



FIGURE 170 



GTCCCACATCCTGCTC^y^CTGGGTCaGGTCCCTCTTAGACCAGCTCTTGTCCT.Ta^TTTGCTGA^ 

TAGTTCCCCAGTAGGGGGTCTCCCCTGGCAATTCTTGATCGGCGTTTGGACaTCTCAGATCGCTTCCAATGAAGA 

TGGCCrTGCCTTGGGGTCCTGCTTGTTTCA.TAATa^TCTAACTATGGGACRAGGTTGTGCCGGa^GCTCTGGGG^ 

AAGQAGCACGGGGCTGATCAAGCCATCCAGGAAACACTGGAGGACTTGTCCAGCCTTGAAAGAACTCTAGTGGTT 

TCTGAATCTAGCCCACTTGGCGGTAAGCMSATGCAACTTCTGCAACTTCTGCTGGGGCTTTTGGGG 

CTACTTATTTCTTTTAGGGGATTGTCAGGAGGTGACCACTCTCACGGTGAAATACCAAGTGTCAGAGGAAGTGCC 

ATCTGGTACAGTGATCGGGAAGCTGTCCCA'SGAACTGGGCCGGGAGGAQAGGCGGAGGCAAGCTGGGGCCGCCTT 

CCAGGTGTTGCAGCTGCCTCAGGCGCTCCCCATTCAGGTGGACTCTGAGGAAGGCTTGCTCAGCACAGGCAGGCG 

GCTGGATCGAGAGCAGCTGTGCCGACAGTGGGATCCCTGCCTGGTTTCCTTTGATGTGCTTGCCACAGGGGATTT 

GGCTCTGATCCATGTGGAGATCCAAGTGCT.3QACATCaATGACCACCAGCCACGGTTTCCCAAAGGCGAGCAGGA 

GCTGGAAATCTCTGAGAGCGCCTCTCTGCGftACCCGQATCCCCCTGGACAGAGCTCTTGACCCAGACACAGGCCC 

TAACACCCTGCACACCTACACTCTGTCTCOCAGTQAGCACTTTGCCTTGGATGTCATTGTGGGCCCTGATGAGAC 

CayUlCATGCaGAACTCATAGTGGTGAAGGAGCTGGACAGGGAAATCCffiLTTCaTTTTTTGATCrGGTGTTAACTG 

CTATGACAATGGGAACCCCCCCAAGTCAGGrACCAGCTTGGTCAAGGTCAACGTCTTGGACTCCAATGACAATAG 

CCCTGCGTTTGCTGAGAGTTCACTGGCACTGGAAATCO^GAAGATGCTGCaCCTGGTACGCTTCTCATAAS^ 

GACCGCCaCAGACCOTGACCAAGGCCCCAATGGGQAGGTGGAGTTCTTCCTCAGTAAGCACATGCerC^ 

GCTGGACACCTTCAGTATTGATGCCAAGAa^GGCCAGGTCATTCTGCGTCQACCTCTAGACTATGAAAAGAACCC 

TGCCTACGAGGTGGATGTTC:AGG<:aAGGQACCTGGGTCCC:ffiATCCTATCCO^GCCCa.TTO 

GGTTCTGGATGTCAATGACAACATCCCAAGCATCCACGTCACATGGGCCTCCaiGCCATCACTGGTGTC^ 

TCTTCCCAAGGACAGTTTTATTGCTCTTGTi^ATGGCAGATGACTTGGATTCAGGACSlCaATGGTTTG^^ 

CTGGCTGAGCCAAGAGCTGGGCCACTTCaGGCTGAaAAGAACTAATGGCAACaCATACS^TGTTGCTAACCAAT^ 

CACACTGGACAGAGAGO^GTGGCCCaAATATACCCTCACTCTGTTAGCCCAAGACaVAGGACTCCAGC 

AGCCAAGAAA(^GCTCAGCATTCAGATCaGTGACATCAACGACAATGCACCTGTGTTTGAGAA2VAGCAGGTATGA 

AGTCTCCACGCGGGA?^<::aACTTACCCTCTCTTCaCCTa^TTACCATCaAGGCTCATGATGa^GACT^ 

TAATGGAAAAGTCTCATACCGCATCCAGGACTCCCCAGTTGCTCACTTAGTAGCTATTGACTCCAACACAGGAGA 

GGTCACTGCTCAGAGGTCACTGAACTATGAJIGAGATGGCCGGCTTTGAGTTCCAGGTGATCGCAGAGGACAGCGG 

GCAACCCATGCTTGCATCCAGTGTCTCTGTGTGGGTCAGCCTCTTGGATGCCAATGATAATGCCCCAGAGGTGGT 

CCAGCCTGTGCTCAGCGATGGAAAAGCCAGCCTCTCCGTGCTTGTGAATGCCTCCACAGGCCACCTGCTGGTGCC 

Ca.TCGAGACTCCCAATGGCTTGGGCCCAGCGGGCACTGACACACCTCCACTGGCCACTC:AC:a 

CCTTTTGACaACCATTGTGGCAAGAGATGQVGACTCGGGGGCAAATGGAGAGCCCCTCTACAGCATCCGCAATGG 

AAATGAAGCCCACCTCTTCATCCTCAACCCTCATACGGGGCAGCTGTTCGTCAATGTCACa^TGCCAGCAGCCT 

CATTGGGAGTGAGTGGGAGCTGGAQATAGTAGTAGAGGACCAGGGAAGCCCCCCCTTACAGACCCGAGCCCTGTT 

GAGGGTOITGTTTGTCACCAGTGTGGACCACJCTGAGGGACTCAGCCCGCAAGCCTGGGGCCTTGAGCATGTCGAT 

gctgacggtgatctgcctggctgtactgttgggcatcttcgggttgatcctggctttgttcatgtccatctgccg 
gacagaaaagaaggacaacagggcctacaac:tgtcgggaggccgagtccacctaccgccagcagcccaagaggcc 

CCAGAAACACATTCAGAAGGCAGACATCCAGCTCGTGCCTGTGCTCAGGGGTCAGGCAGGTGAGCCTTGTGAAGT 

CGGGCAGTCCCACAAAGATGTGGACAAGGAGK3CGATGATGGAAGCAGGCTGGGACCCCTGCCTGCAGGCCCCCTT 

CCACCTCACCCCGACCCTGTACAGGACGCTGfCGTAATCAAGGCyUiCCAGGGAGCACCGGCGGAGAGCCGAGAGGT 

GCTGCmGACACGGTCAACCTCCTTTTCAAC:CATCCCAGGCAGAGGAATGCCTCCCGGGAGAACCTGAACCTTCC 

CGAGCCCCAGCCTGCO^CAGGCCAGCCACGTTCCAGGCCTCTGAAGGTTGCAGGCAGCCCCACAGGGAGGCTGGC 

TGGAGACCAGGGCAGTGAGGAAGCCCCACACJAGGCCACakGCCTCCTCTGCAACCCTGAGACGGCAGCGACATOT 

CmTGGC:AAAGTGTCCCCrGAGAAAGAATCa.GGGCCCCGTCaVGATCCTGCGGAGCCTGGTCCGGCTGTCTGTGGC 

TGCCTTCGCCGAGCGGAACCCCGTGGAGGAGCTCACTGTGGATTCTCCTCCTGTTCAGCAAATCTCCCAGCTGCT 

GTCCTTGCTGCATCAGGGCCAATTCCAGCCCAAACCAAACCACCGAGQAAATAAGTACTTGGCCAAGCCAGGAGG 

CaGC:AGGAGTGCaATCCCAQACACAGATGGC!CCAAGTGCAAGGGCTGGAGGCCAGACAGACCCAGaAC^ 

AGGGCCTTTGGATCCTGAAGAGGACCTCTCTGTGAAGCAACTGCTAGAAGAAGAGCTGTCAAGTCTGCTGGACCC 

O^GCaCAGGTCTGGCCCTGGACCGGCTGAGCGCCCCTGACCCGGCCTGGATGGCQAGACTCTCTTTGCCCCTCAC 

CACCAACTACCGTGACAATGTGATCTCCCCGGATGCTGCAGCCACGGAGGAGCCGAGGACCTTCCAGACGTTCGG 

CAAGGa^QAGGG?lCCAGAGCTGAGCCa^Ca.GGCACGAGGCTGGCCAGCACCTTTGTCTCGGAQATGAGC 

GCTGGAGATGCTGCTGGAACAGCGCTCCAGCATGCCCGTGGAGGCCGCCTCCGAGGCGCTGCGGCGGCTCTCGGT 

CTGCGGGAGGACCCTCAGTTTAGACTTGGCCACC:AGTGCa.GCCTCAGGCATGAAAGTGC:AAGGGGACCCAGGT^ 

AAAGACGGGGACTGAGGGCAAGAGCAGAGGCAGCAGCaGCaGCAGCAGGTGCCTGTGAACATACOT 

CTGGATCCAAGAACCAGGGGCCTGAGGATCTGTGGACAAGAGCTGGTTTCTAAAATCTTGTAACTCACTAGCTAG 

CGGCGGCCTGAGAACTTTAGGGTGACTGATGCTACCCCCACAGAGGAGGOUiGAGCCCCAGGACTAACAGCTGAC 

TGACCAAAGCAGCCCCTTGTAAGCAGCTCTGAGTCTTTTGGAGGACAGGGACGGTTTGTGGCTGAGATAAGTGTT 

TCCTGGCAAAACATATGTGGAGCACAAAGGGTCAGTCCTCTGGCAGAACAGATGCCACGGAGTATCACAGGCAGG 

AAAGGGTGGCCTTCTTGGGTAGCAGGAGTCAGGGGGCTGTACCCTGGGGGTGCCAGGAAATGCTCTCTGACCTAT 

CaATAAAGGAAAAGCAGTAAAAAaAAAAAAA?^AAAAAA 



FIGURE 171 



</usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA48331 
oubunit 1 of 1, 1184 aa, 1 stop 
<MW: 129022, pi: 5.20, NX(S/T): 5 

MMQLLQLLLGLLGPGGYLFLLGDCQEVTTLTVKYQVSEEVPSGTVIGKLSQELGREERRRQA 
GAAFQVLQLPQALPIQVDSEEGIiLSTGRRLDREQLCRQWDPCLVSFDVIATGDLALIHVEIQ 
VLDINDHQPRFPKGEQELEI SESAS LRTRI PLDRALDPDTGPNTLHTYTLSPSEHFALDVIV 
GPDETKHAELIWKELDREIHSFFDLVLTAYDNGNPPKSGTSLVKVNVLDSNDNSPAFAESS 
LALEIQEDAAPGTLLIKLTATDPDQGPNGEVEFFLSKHMPPEVLDTFSIDAKTGQVILRRPL 
DYEKNPAYEVDVQARDLGPNPI PAHCKVLI KVLDVNDNI PS IHVTWASQPSLVSEALPKDSF 
lALVMADDLDSGHNGLVHCWLSQELGHFRLKRTNGNTYMLLTNATLDREQWPKYTLTLLAQD 
QGLQPLSAKKQLSIQISDINDNAPVFEKSRYEVSTRENNLPSLHLITIKAHDADLGINGKVS 
YRIQDSPVAHLVAIDSNTGEWAQRSLNYEEMAGFEFQVIAEDSGQPMLASSVSVWVSLLDA 
NDNAPEWQPVLSDGKASLSVLVNASTGHLLVPIETPNGLGPAGTDTPPLATHSSRPFLLTT 
IVARDADSGANGEPLYSIRNGNEAHLFILNPHTGQLFVNVTNASSLIGSEWELEIWEDQGS 
PPLQTRALLRVMFVTSVDHLRDSARICPGALSMSMLTVICLAVLLGIFGLIIALFMSICRTEK 
KDNRAYNCREAESTYRQQPKRPQKHIQKADIHLVPVLRGQAGEPCEVGQSHKDVDKEAMMEA 
GWDPCLQAPFHLTPTLYRTLRNQGNQGAPAESREVLQDTVNLLFNHPRQRNASRENLNLPEP 
QPATGQPRSRPLKVAGSPTGRLAGDQGSEEAPQRPPASSATLRRQRHLNGKVSPEKESGPRQ 
ILRSLVRLSVAAFAERNPVEELTVDSPPVQQISQLLSLLHQGQFQPKPNHRGNKYLAKPGGS 
RSAIPDTDGPSARAGGQTDPEQEEGPLDPEEDLSVKQLLEEELSSLLDPSTGLALDRLSAPD 
PAWMARLSLPLTTNYRDNVISPDAAi^TEEPRTFQTFGKAEAPELSPTGTRLASTFVSEMSSL 

LEMLLEQRSSMPVEAASEALRRLSVCGRTLSLDLATSAASGMKVQGDPGGKTGTEGKSRGSS 
SSSRCL 

Important features: 
Signal peptide: 
amino acids 1-13 
Transmembrane domain: 
amino acids 719-739 
N-glycosylation site. 

amino acids 415-418, 582-585, 659-662, 662-665 amd 857-860 
Cadherins extracellular repeated domain signature. 

amino acids 123-133, 232-242, 340-350, 448-458 and 553-563 



FIGURE 172 

CGGACGCGTGGGCGGACGCGTGGGG(GAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGG 
CAGACCGTGTGAGGGGGCCTGTGGGCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAG 
GCAGGAGCCTTCCTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCT 
CCCAGATACTATTTTTTGGATTTGGtGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTAT 
GAGATACGTCAGTATGTTGTACAGGTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCAT 
GTTTGAGCTCATCATCTTTGAAATCTTAGGAGTATTGAATAGCAGCTCCCGTTATTTTCACT 
GGAAAATGAACCTGTGTGTAATTC']?GCTGATCCTGGTTTTCATGGTGCCTTTTTACATTGGC 
TATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGACTGCTTTTTTCCTGTCTCTT 
ATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCCCATTCTCAGCCCAA 
AACATGGGATCTTATCCATAGAACiltGCTCATCAGCCGGGTTGGTGTGATTGGAGTGACTCTC 
ATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTCCT 
CAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATA 
TGATCATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAA 
GTGCATAACAAACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGG 
AAGTGAAAATCTTACTCTTATTCA2^.CAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGC 
TTTTTCTGGAAACAGCTGATCTATATGCTACCAAGGAGAGAATAGAATACTCCAAAACCTTC 
AAGGGGAAATATTTTAATTTTCTTGGTTACTTTTTCTCTATTTACTGTGTTTGGAAAATTTT 
CATGGCTACCATCAATATTGTTTTTGATCGAGTTGGGAAAACGGATCCTGTCACAAGAGGCA 
TTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGTGAAGTTTTGGTCCCAACACATT 
TCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGATTGCTGATCACTCTTAC 
CAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTGCTATTAGCAC 
AGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTTAGAA 
TACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTT 
TGATGTGATCTTCCTG6TCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAAC 
AGGCACCAGAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGT 
GGTTTCAAAATTTAGATATAAGAGG(3GGGAAAAATGGAACCAGGGCCTGACATTTTATAAAC 
AAACAAAATGCTATGGTAGCATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATA 
CTATGACCATGAGTAGCATCAGCCAGAACATGAGAGGGAGAACTAACTCAAGACAATACTCA 
GCAGAGAGCATCCCGTGTGGATATGAGGCTGGTGTAGAGGCGGAGAGGAGCCAAGAAACTAA 
AGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGTCTATGGTAGCTGAGCCAAACACGT 
AGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTGCCTTGAGATTGACTCATT 
AAAATCAGAGACTGTAACAAAAAAAZiAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCG 
ACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 



FIGURE 173 

MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEI 
LGVLNSSSRYFHWKMSTLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFF 
WKLGDPFPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTDI 
I^LERRLLQTMDMIISKKKRMAMAEIRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQ 
QEVDALEELSRQLFLETADLYATKERIEYSKTFKGKYFNFLGYFFS lYCVWKI FMATINIVF 
DRVGKTDPVTRGIEITVNYLGIQFDVKFWSQHISFILVGI I IVTSIRGLLITLTKFFYAISS 
SKSSNVIVLLLAQIMGMYFVSSVLLIRMSMPLEYRTIITEVLGELQFNFYHRWFDVIFLVSA 
LSSILFLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 



FIGURE 174 



CATGGGAAGTGGAGCCGGAGCCTTCCTTACACTCGCCATGAGTTTCCTCATCGACTCCAGCA 
TCATGATTACCTCCCNGANACTATTTTTTGGATTTGGGTGGCTTTTCTTCNGCGCCAATGTT 
TAAAGACTATGAGATACGTCAGTATGTTGTACNGGTGATCTTCTCCGTGACGTTTGCCATTT 
CTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTNGGAGTATTGAATAGCAGCTCCCGT 
TATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGGTTNTCATGGTGCCTTT 
TTACATTGGCTATTTTATTGTGAGCJVATATCCGACTACTGCATAAACAACGACTGCTTTTTT 
CCTGTCTCTTATGGCTGACCTTTATGTATTTCCAG 



FIGURE 175 



GTGTTGCCCTTGGGGAGGGGAAGGG(3AGCCNGGCCCTTTCCTAAAATTTGGCCAAGGGTTTC 
TTTNTTGAATTCCGGGTTNNGNATACCTTCCCAGAAAATATTTTTTGGATTTGGGGTAGNTT 
TTTTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAGGTGATNTT 
NTCCGTGACGTTTGCATTTTCTTGGA.CCATGTTTGAGCTCATCATNTTTGAAATNTTAGGAG 
TATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATC 
CTGGTTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCA 
TAAACAACGACTGCTTTTTTCCTGTCTNTTATGGCTGACCTTTATGTATTTNTTNTGGAAAN 
TAGGAGATCCCTTTCCCATTCTC 



CTCGCGCAGGGATCGTCCCATGGCCGGGGCrCGGAGCCGCGACCCTTGGGGGGCCTCCGGGATTTGCTACCTTTT 

TGGCTCCCTGCTCGTCGAACTGCTCTTCTCACGGGCTGTCGCCTTCAATCTGGACGTGATGGGTGCCTTGCGCAA 

GGAGGGCGAGCCAGGCaGCCTCTTCGGCTTCTCTGTGGCCCTGCACCGGCAGTTGCAGCCCCGACCCCAGAGCTG 

GCTGCTGGTGGGTGCTCCCCAGGCCCTGGCrCTTCCTGGGCAGCAGGCGAATCGCACTGGAGGCCTCTTCGCTTG 

CCCGTTGAGCCTGGAGGAGACTGACTGCTA'CaiGAGTGGACaTCGACCAGGGAGCTGATATGCAAAAGGAAAGC^ 

GQAGAACCAGTGGTTGGGAGTCAGTGTTCG3AGCCAGGGGCCTGGGGGCAAGATTGTTACCTGTGCACACCGATA 

TGAGGCAAGGCAGCGAGTGGACCAGATCCT(3QAGACGCGGGATATGATTGGTCGCTGCTTTGTGCTCAGCCAGGA 

CCTGGCCATCCGGGATGAGTTGGATGGTGGGGAATGGAAGTTCTGTGAGGGACGCCCCCAAGGCCATGAACAATT 

TGGGTTCTGCCAGCAGGGCACAGCTGCCGCCTTCTCCCCTGATAGCCACTACCTCCTCTTTGGGGCCCCAGGAAC 

CTATAATTGGAAGGGCa.CGGCCAGG6TGGAGCTCTGTGCACAGGGCTCAGCGGACCTGGCACACCTGGACGACGG 

TCCCTACGAGGCGGGGGGAGAGAAGGAGCA(3GACCCCCGCCTCATCCCGGTCCCTGCCAACAGCTACTTTGGCTT 

CTCTATTGACTCGGGGAAAGGTCTGGTGCGTGCAGAAGAGCTGAGCTTTGTGGCTGGAGCCCCCCGCGCCAACCA 

CAAGGGTGCTGTGGTCATCCTGCGCAAGGA.CAGCGCCAGTCGCCTGGTGCCCGAGGTTATGCTGTCTGGGGAGCG 

CCTGACCTCCGGCTTTGGCTACTCACTGGCTGTGGOTGACCTCAACAGTGATGGCTGGCCAGACCTGATAGTGGG 

TGCCCCCTACTTCTTTGAGCGCCAAGAAGAGCTGGGGGGTGCTGTGTATGTGTACTTGAACCAGGGGGGTCACTG 

GGCTGGGATCTCCCCTCTCCGGCTCTGCGGCTCCCCTGACTCCATGTTCGGGATCAGCCTGGCTGTCCTGGGGGA 

CCTCAACCAAGATGGCTTTCCAGATATTGaVGTGGGTGCCCCCTTTGATGGTGATGGGAAAGTCTTCATCTACCA 

TGGGAGCAGCCTGGGGGTTGTCGCCaAACCTTCACAGGTGCTGGAGGGCGAGGCTGTGGGCATCAAGAGCTTCGG 

CTACTCCCTGTCAGGCAGCTTGGATATGQATGGGAACCAATACCCTGACCTGCTGGT6GGCTCCCTGGCTGACAC 

CGCaGTGCTCTTOlGGGCCAGACCCATCCTCCATGTCTCCCATGAGGTCTCTATTGOTCCACGAAGCATCQACCT 

GGAGCAGCCCAACTGTGCTGGCGGCCACTCCK3TCTGTGTGGACCTAAGGGTCTGTTTCAGCTACATTGCAGTCCC 

CAGCAGCTATAGCCCTACTGTGGCCCTGGACTATGTGTTAGATGCGQACACAGACCGGAGGCTCCGGGGCCAGGT 

TCCCCGTGTGACGTTCCTGAGCCGTAACCTCKSAAGAACCCAAGCACCAGGCCTCGGGCACCGTGTGGCTGAAGCA 

CCAGCATGACCGAGTCTGTGGAGACGCCATGTTCCAGCTCCAGGAAAATGTCaAAGACAAGCTTCGGGCCATTGT 

AGTGACCTTGTCCTACAGTCTCCAGACCCCTCGGCTCCGGCGACAGGCTCCTGGCCAGGGGCTGCCTCCAGTGGC 

CCCCATCCTCAATGCCCACCAGCCCAGCACCCAGCGGGCAGAGATCCACTTCCTGAAGCAAGGCTGTGGTGAAGA 

CAAGATCTGCCAGAGCAATCTGCAGCTGGTCCAOSCCCGCTTCTGTACCCGGGTCAGCGACACGGAATTCCAACC 

TCTGCCCAT6GATGTGGATGGAACAACAGCCCTGTTTGCACTGAGTGGGCAGCCAGTCATTGGCCTGGAGCTGAT 

GGTCa.CCAACCTGCCATCGGACCCAGCCCAGCCCCAGGCTGATGGGGATGATGCCCATQAAGCCCa^ 

CaTGCTTCCTGACTCaCTGCACTACTCAGGCX3TCCGGGCCCTGGACCCTGCGGAGAAGCCACTCTGCCTGTCCAA 

TGAGAATGCCTCCCATGTTGAGTGTGAGCTGGGQAACCCCATGAAGAGAGGTGCCCAGGTCACCTTCTACCTCAT 

CCTTAGCACCTCCGGQATCAGCATTGAGACCIACGQAACTGGAGGTAGAGCTGCTGTTGGCCACQATCAGTQAGCA 

GGAGCTGCS^TCCAGTCTCTGOlCGAGCCCGTGTCTTCATTGAGCTGCCaCTGTCCATTGCAGGAATGGCCATrc^ 

CCAGCAACTCTTCTTCTCTGGTGTGGTGAGGiGGCGAGAQAGCCATGCAGTCTGAGCGGGATGTGGGCAGC^ 

a^GTATGAGGTCACGGTTTCCAACCaAGGC:CAGTCGCTCAGAACCCTGGGCTCTGCCTTCCTCAACATCATC^ 

GCCTCATGAGATTGCCAATGGGAAGTGGTTGrcTGTACCCAATGCAGGTTGAGCTGGAGGGCGGGCAGGGGCCTGG 

SS^^^^'^^^^'^^^^^^^^^^'^^^^^^^CTGGATGTGGACAGTAGGGATAGGAGGC^^ 

GCTGGAGCCACCTGAGCAGCAGGAGCCTGGTGAGCGGCAGGAGCCCAGCATGTCCTGGTGGCCAGTGTCCTCTGC 

^^^S^^^^^^^^*^^^^^^^®^^^^^'^^CC<3®3GCACGGCC^ 

CAGCTTTGACCGCGCGGCTGTGCTGCATGTCTGGGGCCGTCTCTGGAACAGCACCTTTCTGGAGGAGTACTCAGC 

^^^^^S'^^^^T^^G'TGATTGTCCGGGCCAACATCACaGTGAAGTCCTCCATAAAGAACTTGATC^ 

^^S^^^^^^^^^^^^^^^^'^^'^^'^^^'T'T^CCCCATGGCTGTGGTGGC^ 

CAAACGGGCGAAGCACCCCGAGGCCACCGTGCCCCAGTACCATGCGGTGAAGATTCCTCGGGAAGACCGACAG^ 

GTTCaAGGAGQAGAAGACGGGCACCATCCTGAGGAACAACTGGGGCAGCCCCCGGCGGGAGGGCCCGGATGCACA 

CCCCATCCTGGCTGCTQACGGGCATCCCGAGCTGGGCCCCGATGGGCATCCAGGGCCAGGCACCGCCTAG^^ 

S5^^^^SE^S^^^^^^"^°^CTGCCCTCCATCCCTTCCCCAGAGATGGCTCCTTGGGATG^ 

S??S^^9T^^°^CG^TCAAGATTTGGa^GGATCGGCTTCCTCAGGGGCACAGACCTCTCC^^ 

TCCTCCOICCCAACTTCCCCTTAGAGTGCTGTGAGATGAGAGTGGGTAAATCAGGGACAGGGCC^^ 

TGAGAAGGGCAGGGGTGTCCTGATGCAAAGGTGGGGAGAAGGGATCCTAATCCCTTCCTCTCCCATTCACCCTGT 

GTAACAGGACCCCAAGGACCTGCCTCCCCGGAAGTGCCTTAACCTAGAGGGTCGGGGAGQAGGTTGTGTC^^ 

S™^^^°^'^'^^^^'^^^^^'^"^^^^^^^TCATCTGACCTTAGTTTGCTGC 

TTCGTCTATTTATTAAAAAATATTTGAGAAGfiAAAAAAAAAAAAAAAAAAA 



X /us r / s eqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA5 5 7 3 7 
xsubunit 1 of 1, 1141 aa, 1 stop 
xMW: 124671, pi: 5.82, NX{S/D : 5 

MAGARSRDPWGASGICYLFGSLLVELLFSRAVAFNLDVMGALRKEGEPGSLFGFSVALHRQL 
QPRPQSWLLVGAPQALALPGQQANRTGGLFACPLSLEETDCYRVDIDQGADMQKESKENQWL 
GVSVRSQGPGGKIVTCAHRYEARQR'TDQILETRDMIGRCFVLSQDLAIRDELDGGEWKFCEG 
RPQGHEQFGFCQQGTAAAFSPDSHYLLFGAPGTYNWKGTARVELCAQGSADIiAHLDDGPYEA 
GGEKEQDPRLIPVPANSYFGFSIDSGKGLVRAEELSFVAGAPRANHKGAWILRKDSASRLV 
PEVMLSGERLTSGFGYSLAVADLNSDGWPDLIVGAPYFFERQEELGGAVYVYLNQGGHWAGI 
SPLRLCGSPDSMFGISLAVLGDLNQDGFPDIAVGAPFDGDGKVFIYHGSSLGWAKPSQVLE 
GEAVGIKSFGYSLSGSLDMDGNQYP.DLLVGSLADTAVLFRARPILHVSHEVSIAPRSIDLEQ 
PNCAGGHSVCVDLRVCFSYIAVPSSYSPTVALDYVLDADTDRRLRGQVPRVTFLSRNLEEPK 
HQASGTVWLKHQHDRVCGDAMFQLQENVKDKLRAIWTLSYSLQTPRLRRQAPGQGLPPVAP 
ILNAHQPSTQRAEIHFLKQGCGEDKICQSNLQLVHARFCTRVSDTEFQPLPMDVDGTTALFA 
LSGQPVIGLELMVTNLPSDPAQPQADGDDAHEAQLLVMLPDSLHYSGVRALDPAEKPLCLSN 
ENASHVECELGNPMKRGAQVTFYLILSTSGISIETTELEVELLLATISEQELHPVSARARVF 
lELPLS lAGMAI PQQLFFSGWRGEl^AMQSERDVGSKVKYEVTVSNQGQSLRTLGSAFLNIM 
WPHEIANGKWLLYPMQVELEGGQGPGQKGLCSPRPNILHLDVDSRDRRRRELEPPEQQEPGE 
RQEPSMSWWPVSSAEKKKNITLDCARGTANCWFSCPLYSFDRAAVLHVWGRLWNSTFLEEY 
SAVKSLEVIVRANITVKSSIKNLMKRDASTVIPVMVYLDPMAVVAEGVPWWVILLAVLAGLL 
VLALLVLLLWKMGFFKRAKHPEATVPQYHAVKI PREDRQQFKEEKTGTI LRfcJNWGSPRREGP 
DAHPIL2iADGHPELGPDGHPGPGTA 



Important features: 
Signal peptide: 

amino acids 1-33 



Transmembrane domain: 

amino acids 1040-1062 



N-glycosylation sites. 

amino acids 86-89, 746-749, 949-952, 985-988 and 1005-1008 



Integrins alpha chain proteins. 

amino acids 1064-1071, 384-408, 1041-1071, 317-346, 443-465, 385- 
407, 215-224, 634-647, 85-99, 322-346, 470-479, 442-466, 379-408 
and 1031-1047 



FIGURE 178 



CGCGCCGGGCGCAGGGAGCTGAGTGKSACGGCTCGAGACGGCGGCGCGTGCAGCAGCTCCAGA 
AAGCAGCGAGTTGGCAGAGCAGGGCTGCATTTCCAGCAGGAGCTGCGAGCACAGTGCTGGCT 
CACAACAAGATGCTCAAGGTGTCAGCCGTACTGTGTGTGTGTGCAGCCGCTTGGTGCAGTCA 
GTCTCTCGCAGCTGCCGCGGCGGTGGCTGCAGCCGGGGGGCGGTCGGACGGCGGTAATTTTC 

tggatgataaacaatggctcaccac:aatctctcagtatgacaaggaagtcggacagtggaac 
aaattccgagacgaagtagaggatgattatttccgcacttggagtccaggaaaacccttcga 
tcaggctttagatccagctaaggatccatgcttaaagatgaaatgtagtcgccataaagtat 

GCATTGCTCAAGATTCTCAGACTGCAGTCTGCATTAGTCACCGGAGGCTTACACACAGGATG 

aaagaagcaggagtagaccataggcagtggaggggtcccatattatccacctgcaagcagtg 

CCCAGTGGTCTATCCCAGCCCTGTITGTGGTTCAGATGGTCATACCTACTCTTTTCAGTGCA 

aactagaatatcaggcatgtgtcttaggaaaacagatctcagtcaaatgtgaaggacattgc 

CCATGTCCTTCAGATAAGCCCACCAGTACAAGCAGAAATGTTAAGAGAGCATGCAGTGACCT 

ggagttcagggaagtggcaaacagattgcgggactggttcaaggcccttcatgaaagtggaa 
gtcaaaacaagaagacaaaaacattgctgaggcctgagagaagcagattcgataccagcatc 
ttgccaatttgcaaggactcacttggctggatgtttaacagacttgatacaaactatgacct 
gctattggaccagtcagagctcagaagcatttaccttgataagaatgaacagtgtaccaagg 
cattcttcaattcttgtgacacatacaaggacagtttaatatctaataatgagtggtgctac 

TGCTTCCAGAGACAGCAAGACCCACCTTGCCAGACTGAGCTCAGCAATATTCAGAAGCGGCA 

aggggtaaagaagctcctaggacagtatatccccctgtgtgatgaagatggttactacaagc 

CAACACAATGTCATGGCAGTGTTGGACAGTGCTGGTGTGTTGACAGATATGGAAATGAAGTC 
ATGGGATCCAGAATAAATGGTGTTGC^AGATTGTGCTATAGATTTTGAGATCTCCGGAGATTT 
TGCTAGTGGCGATTTTCATGAATGGACTGATGATGAGGATGATGAAGACGATATTATGAATG 

atgaagatgaaattgaagatgatgatgaagatgaaggggatgatgatgatggtggtgatgac 

CATGATGTATACATTTGATTGATGAiZAGTTGAAATCAATAAATTCTACATTTCTAATATTTA 

caaaaatgatagcctatttaaaattatcttcttccccaataacaaaatgattctaaacctca 

CATATATTTTGTATAATTATTTGAAJiAATTGCAGCTAAAGTTATAGAACTTTATGTTTAAAT 

aagaatcatttgctttgagtttttatattccttacacaaaaagaaaatacatatgcagtcta 

GTCAGACAAAATAAAGTTTTGAAGTCJCTACTATAATAAATTTTTCACGAGAACAAACTTTGT 

aaatcttccataagcaaaatgacagctagtgcttgggatcgtacatgttaattttttgaaag 
ataattctaagtgaaatttaaaatajyitaaatttttaatgacctgggtcttaaggatttagg 
aaaaatatgcatgctttaattgcatttccaaagtagcatcttgctagacctagatgagtcag 

GATAACAGAGAGATACCACATGACTCCAAAAAAAAAAAAAAA 



FIGURE 179 



>< /usr / s eqdb2 / s s t /DNA/Dnas eqs . min/ s s . DNA4 9829 
xsubunit 1 of 1, 436 aa, 1 stop 
xMW: 49429, pi: 4.80, NX(S/T): 0 

MLKVS AVLCVCAAAWCSQSLAAAAA VAAAGGRSDGGNPLDDKQWLTT I SQYDKEVGQWNKFR 
DEVEDDYFRTWSPGKPFDQALDPAKDPCLKMKCSRHKYCIAQDSQTAVCISHRRLTHRMKEA 
GVDHRQWRGPILSTCKQCPWYPSPVCGSDGHTYSFQCKLEYQACVLGKQISVKCEGHCPCP 
SDKPTSTSRNVKRACSDLEFREVANRIiRDWFKALHESGSQNKKTKTLLRPERSRFDTSILPI 
CKDSLGWMFNRLDTNYDLLLDQSELRSIYLDKNEQCTKAFFNSCDTYKDSLISNNEWCYCFQ 
RQQDPPCQTELSNI QKRQGVKKLLGQYI PLCDEDGYYKPTQCHGSVGQCWCVDRYGNEVMGS 
RINGVADCAIDFEISGDFASGDFHEWTODEDDEDDIMJSTDEDEIEDDDEDEGDDDDGGDDHDVYI 

Important features: 
Signal peptide: 

amino acids 1-16 

Leucine zipper pattern. 

amino acids 246-267 

N-myristoylation sites, 

amino acids 357-362, 371-376 and 376-381 

Thyroglobulin type-1 repeiit proteins 

amino acids 353-365 and 339-352 



FIGURE 180 



CAGACTCCAGATTTCCCTGTC^^CCACGSiGGiAGTCCaGAGAGGAAACGCGGAGCGGAGACaACAGTACCT 
CTCTTTCAGCCCGGGATCGCCCCAGCAGGGATGGGCGACaAGATCTGGCTGCCCTO 

GCTCTGCCTCCGGTGCTGCTGCCTGGGGCGCJCCGGCTTCACACCTTCCCTCGATAGCGACTTCACCTTTACCCTT 

CCCGCCGGCCAGAAGGAGTGCTTCTACCAGCCCATGCCCCTGAAGGCCTCGCTGGAGATCGAGTACCAAGTTTTA 

GATGGAGCAGGATTAGATAITGATTTCCATCITTGCCTCTCCAGAAGGCAAAACCTTAGTTTTTGAACAAAGA 

TCAGATGGAGTTCACACTGTAGAGACTGAAGITTGGTGATTACaTGTTCTGCTTTGACmATAC:a.TTC:aGCACC^^ 

TCTGAGAAGGTGATTTTCTTTGAATTAATCCITGGATAATATGGGAGAACaGGCACAAGAACAAGAAGATTGGAAG 

AAATATATTACTGGCACAGATATATTGGATATGAAACTGGAAGACmTCCTGGRATCCATCAACAGCATC^ 

AGACTAAGCAAAAGTGGGCACATACAAATTCTGCTTAGAGCATTTGAAGCTCGTGATCGAAAGRTAC^ 

AACTTTGATAGAGTCAATTTCTGGTCTATGGTTAATTTAGTGGTCATGGTGGTGGTGTCAGCCATTCAAGTTTAT 

ATGCTGAAGAGTCTGTTTGAAGATAAGAGGSAAAGTAGAACTTM^^CTCCAAACTAGAGTACGTAACATTGAAA 

AATGAGGCATAAAAATGCaATAAACTGTTACAGTCy^GACCATTAATGGTCTTCTCCAAAATATTTTG^ 

AAAGTAGGAAACAGGTATAATTTTAATGTGJAAATTAAGTCTTCACTTTCTGTGCAAGTAATCCTGCTGATCCAG 

TTGTACTTAAGTGTGTAACAGGAATATTTTGCAGAATATAGGTTTAACTGAAT6AAGCCATATTAATAACTGCAT 

TTTCCTAACTTTGAAAAATTTTGCAAATGTCTTAGGTGATTTAAATAAATGAGTATTGGGCCTAA'TTGCAACACC 

AGTCTGTTTTTAACAGGTTCTATTACCCAGftACTTTTTTGTAAATGCGGCAGTTACAAATTAACTGTGGAAGTTT 

TCAGTTTTAAGTTATAAATCACCTGAGAATTACCTAATGATGGATTGAATAAATCTTTAQACTACAAAAGCCCAA 

CTTTTCTCTATTTACATATGCATCTCTCCTATAATGTAAATAGAATAATAGCTTTGAAATACAATTAGGTTTTTG 

AGATTTTTATAACCA^TACATTTCAGTGTJiACATATTAGCAGAAAGCATTAGTCTTTGTACTTTGCTTACATTC 

CCAAAAGCTGACATTTTCaCGATTCTTAAiyiACACa^GTTACACTTACTAAAATTAGGACATG 

AAATGAAGAATATAGTTTAAAAGCTTCCTCCTCCATAGGGACACATTTTCTCTAACCCTTAACTAAAGTGTAGGA 

TTTTAAAATTAAATGTGAGGTAAAATAAGTTTATTTTTAATAGTATCTGTCAAGTTAATATCTGTCAACAGTTAA 

TAATCATGTTATGTTAATTTTAACATGATTGCTGACTTGGATAATTCATTATTACCAGCAGTTATGAAGGAAATA 

TTGCTAAAATGATCTGGGCCTACCATAAATAAATATCTCCTTTTCTGAGCTCTAAGAATTATCAGAAAACAGGAA 

AGAATTTAGAAAAACTTGAGAAAACCTAATCCAAAATAAAATTCACTTAAGTAGAACTATAAATAAATATCTAGA 

ATCTGACTGGCTCATCATGACATCCTACTCATAACATAAATCAAAGGAGATGATTAATTTCCAGTTAGCTGGAAG 

AAACTTTGGCTGTAGGTTTTTATTTTCTACAAGAATTCTGGTTTQAATTATTTTTGTAAGCAGGTACATTTTATA 

AAATGTAAGCCCTACTGTAAGGTTTAGCACTGGGTGTACATATTTATTAAAAATTTTTATTATAACAACTTTTAT 

TAAAATGGCCTTTCTGAACACTTTATTTATTQATGTTGAAGTAAGGATTAGAAACATAGACTCCCAAGTTTTAAA 

CACCTAAATGTGAATAACCCATATATACAACAAAGTTTCTGCCATCTAGCTTTTTGAAGTCTATGGGGGTOTT^^ 

TCAAGTACTAGTAATTTAACTTCATCATGAATGAACTATAATTTTTAAGTTATGCCCATTTATAACGTTGTTTAT 

GACTACATTGTGAGTTAGAAACAAACTTAAAATTTGGGGTATAGAACCCCTCAACAGGTTAGTAATGCTGGAATT 

CTTGATGAGCAATAATGATAACCAGAGAGTG?ITTTCATTTACACTCATAGTAGTATAAAAAGAGATACATTTCCC 

TCTTAGGCCCCTGGGAGAAGAGCAGCTTAGATTTCCCTACTGGCAAGGTTTTTAAAAATGAGGTAAATGCCGTAT 

ATGATCAATTACCTTAATTGGCCAAGAAAATGCTTO^GGTGTCTAGGGGTATCCTCTGCAACACrTGCAGAACAA 

AGGTCAATAAGATCCTTGCCTATGAATACCCCTCCCTTTTGCGCTGTTAAATTTGCAATGAGAAGCAAATTTACA 

GTACCATAACTAATAAAGCAGGGTACAGATATAAACTACTGCATCTTTTCTATAAAACTGTQATTAAGAATTCTA 

CCTCTCCTGTATGGCTGTTACTGTACTGTACTCTCTGACTCCTTACCTAACAATGAATTTGTTACATAATCTTCT 

ACATGTATGATTTGTGCCACTGATOTTAAAOCTATGATTCAGTAACTTCTTACCATATAAAAACGATAATTGCTT 

TATTTGGAAAAGAATTTAGGAATACTAAGGACAATTATTTTTATAGACAAAGTAAAAAQACAGATATTTAAGAGG 

CaiTAACCAAAAAAGCAAAACTTGTAAACAGAGTAAAAATCTTTAATATTTCTAAAGACATACTGTTTATCTGCTT 

CATATGCTTTTTTTAATTTCACTATTCCATTTCTAAATTAAAGTTATGCTAAATTGAGTAAGCTGTTTATCa^CTT 

AACAGCTCATTTTGTCTTTTTCAATATACAAii^TTTTAAAAATACTACyy^TATTTAACTAAG^^ 

CATAATGTAGCAGTTACCGTGTTCACCTCAai^CTAAGGCCTAGAGTTTGCTCTGATATGCATTTGGATGATTAAT 

GTTATGCTGTTCTTTCATGTGAATGTCAAGA'CATGGAGGGTGTTTGTAATTTTATGGTAAAATTAATCCTTCTTA 

CACATAATGGTGTCTTAAAATTGACAAAAAATGAGCACTTACAATTGTATGTCTCCTCAAATGAAGATTCTTTAT 

GTGAAATTTTAAAAGAOlTTGATTCCGCaTGTAAGGATTTTTCATCTGAAGTACAATAATGCACAATCAGTGTTG 

CTCAAACTGCTTTATACTTATAAACAGCCaTCTTAAATAAGCAACGTATTGTGAGTACTGATATGTATATAATAA 

AAATTATCAAAGGAAAA 



FIGURE 181 

></usr/ seqdb2/ sst/DNA/Dnaseqs .min/ss .DNA52196 
xsubunit 1 of 1, 229 aa, 1 stop 
xMW: 26017, pi: 4.73, NX(S/T): 0 

MGDKIWLPFPVLLLAALPPVLLPGA AGFTPS LDSDFTFTLPAGQKECFYQPMPLKAS LE I EY 
QVLDGAGLDIDFHLASPEGKTLVFEQRKSDGVHTVETEVGDYMFCFDNTFSTISEKVIFFEL 
ILDmGEQAQEQEDWKKYITGTDILDMKLEDILESINSIKSRLSKSGHIQILLRAFEARDRN 
IQESNFDRVNFWSMVNLWMVWSAIQVYMLKSLFEDKRKSRT 

Important features: 
Signal peptide: 

amino acids 1-23 

Transmembrane domain: 

amino acids 195-217 

N-myristoylation site. 

amino acids 43-48 

Tyrosine kinase phosphorylation site. 

amino acids 55-62 



FIGURE 182 

CCATCCCTGAGATCTTTTTATAAAAJI^CCCAGTCTTTGCTGACCAGACAAAGCATACCAGAT 
CTCACCAGAGAGTCGCAGACACTATGCTGCCTCCCATGGCCCTGCCCAGTGTGTCCTGGATG 
CTGCTTTCCTGCCTCATTCTCCTGTGTCAGGTTCAAGGTGAAGAAACCCAGAAGGAACTGCC 
CTCTCCACGGATCAGCTGTCCCAAAGGCTCCAAGGCCTATGGCTCCCCCTGCTATGCCTTGT 
TTTTGTCACCAAAATCCTGGATGGATGCAGATCTGGCTTGCCAGAAGCGGCCCTCTGGAAAA 
CTGGTGTCTGTGCTCAGTGGGGCTGAGGGATCCTTCGTGTCCTCCCTGGTGAGGAGCATTAG 
TAACAGCTACTCATACATCTGGATTGGGCTCCATGACCCCACACAGGGCTCTGAGCCTGATG 
GAGATGGATGGGAGTGGAGTAGCACTGATGTGATGAATTACTTTGCATGGGAGAAAAATCCC 
TCCACCATCTTAAACCCTGGCCACTGTGGGAGCCTGTCAAGAAGCACAGGATTTCTGAAGTG 
GAAAGATTATAACTGTGATGCAAAGTTACCCTATGTCTGCAAGTTCAAGGAC TAGG GCAGGT 
GGGAAGTCAGCAGCCTCAGCTTGGC<3TGCAGCTCATCATGGACATGAGACCAGTGTGAAGAC 
TCACCCTGGAAGAGAATATTCTCCCCAAACTGCCCTACCTGACTACCTTGTCATGATCCTCC 
TTCTTTTTCCTTTTTCTTCACCTTCATTTCAGGCTTTTCTCTGTCTTCCATGTCTTGAGATC 
TCAGAGAATAATAATAAAAATGTTACTTTATAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 183 

< /us r / seqdb2 / s s t /DNA/ Dnasesqs . min/ s s . DNA5 6965 
<subunit 1 of 1, 175 aa^ 1 stop 
<MW: 19330, pi: 7.25, NX (S/T) : 1 

MLPPMALPSVSWMLLSCLILLCQVQGEETQKELPSPRISCPKGSKAYGSPCYALFLSPKSWM 
DADLACQKRPSGKLVSVLSGAEGSF7SSLVRSISNSYSYIWIGLHDPTQGSEPDGDGWEWSS 
TDVlVDSryFAWEKNPSTIlJSrPGHCGSLSRSTGFLKWKDYNCDAKLPYVCKFKD 

Important features: 
Signal peptide: 

amino acids 1-26 



C-type lectin domain signature. 

amino acids 146-171 



FIGURE 184 



CCAGTCTGTCGCCACCTCACTTGGTGTCTGCTGTCCCCGCCAGGCAAGCCTGGGGTGAGAGC 

acagaggagtgggccgggaccatgc'sggggacgcggctggcgctcctggcgctggtgctggc 
tgcctgcggagagctggcgccggccctgcgctgctacgtctgtccggagcccacaggagtgt 
cggactgtgtcaccatcgccacctg(:::accaccaacgaaaccatgtgcaagaccacactctac 
tcccgggagatagtgtaccccttccagggggactccacggtgaccaagtcctgtgccagcaa 

GTGTAAGCCCTCGGATGTGGATGGCJLTCGGCCAGACCCTGCCCGTGTCCTGCTGCAATACTG 
AGCTGTGCAATGTAGACGGGGCGCCCGCTCTGAACAGCCTCCACTGCGGGGCCCTCACGCTC 
CTCCCACTCTTGAGCCTCCGACTGT/^AGTCCCCGCCCACCCCCATGGCCCTATGCGGCCCA 
GCCCCGAATGCCTTGAAGAAGTGCCC:CCTGCACCAGGAAAAAAAAAAAAAAAAA 



HPTT! 



FIGURE 18 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA56405 
<subunit 1 of 1, 125 aa, 1 stop 
<MW: 13115, pi: 5.90, NX{S/T): 1 

MRGTRLALIjy^VLAACGELAPALRCWCPEPTGVSDCVTIATCTTNETMCKTTLYSREIVYP 
FQGDSTVTKSCASKCKPSDVDGIGQTLPVSCCOTELCirVDGAPALNSLHCGALTLLPLLSLRL 

Important features: 
Signal peptide: 

amino acids 1-17 



N-glycosylation site. 

amino acids 46-49 



FIGURE 1 



CTGCAGTCAGGACTCTGGGACCGCACSGGGGCTCCCGGACCCTGACTCTGCAGCCGAACCGGC 
ACGGTTTCGTGGGGACCCAGGCTTGCAAAGTGACGGTCATTTTCTCTTTCTTTCTCCCTCTT 
GAGTCCTTCTGAGATGATGGCTCTGC3GCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGG 
TAGCGGCGGCTCTCGGCGGCCACCCTCTGCTGGGAGTGAGCGCCACCTTGAACTCGGTTCTC 
AATTCCAACGCTATCAAGAACCTGCCCCCACCGCTGGGCGGCGCTGCGGGGCACCCAGGCTC 

tgcagtcagcgccgcgccgggaatcc:tgtacccgggcgggaataagtaccagaccattgaca 

ACTACCAGCCGTACCCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTA6T 

cccacccgcggaggggacgcaggcgtgcaaatctgtctcgcctgcaggaagcgccgaaaacg 
ctgcatgcgtcacgctatgtgctgccccgggaattactgcaaaaatggaatatgtgtgtctt 
ctgatcaaaatcatttccgaggagai\attgaggaaaccatcactgaaagctttggtaatgat 
catagcaccttggatgggtattccagaagaaccaccttgtcttcaaaaatgtatcacaccaa 
aggacaagaaggttctgtttgtctccggtcatcagactgtgcctcaggattgtgttgtgcta 

GACACTTCTGGTCCAAGATCTGTAAJiCCTGTCCTGAAAGAAGGTCAAGTGTGTACCAAGCAT 
AGGAGAAAAGGCTCTCATGGACTAGi^AATATTCCAGCGTTGTTACTGTGGAGAAGGTCTGTC 
TTGCCGGATACAGAAAGATCACCATGAAGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGA 
GACACTAAACCAGCTATCCAAATGCAGTGAACTCCTTTTATATAATAGATGCTATGAAAACC 
TTTTATGACCTTCATCAACTCAATCGTAAGGATATACAAGTTCTGTGGTTTCAGTTAAGCAT 
TCCAATAACACCTTCCAAAAACCTGGJAGTGTAAGAGCTTTGTTTCTTTATGGAACTCCCCTG 
TGATTGCAGTAAATTACTGTATTGTiJLAATTCTCAGTGTGGCACTTACCTGTAAATGCAATGA 
AACTTTTAATTATTTTTCTAAAGGTGCTGCACTGCCTATTTTTCCTCTTGTTATGTAAATTT 
TTGTACACATTGATTGTTATCTTGACTGACAAATATTCTATATTGAACTGAAGTAAATCATT 
TCAGCTTATAGTTCTTAAAAGCATAJkCCCTTTACCCCATTTAATTCTAGAGTCTAGAACGCA 
AGGATCTCTTGGAATGACAAATGATAGGTACCTAAAATGTAACATGAAAATACTAGCTTATT 
TTCTGAAATGTACTATCTTAATGCTTAAATTATATTTCCCTTTAGGCTGTGATAGTTTTTGA 
AATAAAATTTAACATTTAAAAAAAAAAAAA 



FIGURE 187 

</usr/seqdb2/sst/DNA/Dnasegs .rain/ss .DNA57530 
<subunit 1 of 1, 266 aa, 1 stop 
<MW: 28672, pi: 8.85, NX(S/T): 1 

MMALGAAGATRVFVAMVAAALGGHPLLGVSATLNSVLNSNAIKNLPPPLGGAAGHPGSAVSA 
APGILYPGGNKYQTIDNYQPYPCAEDEECGTDEYCASPTRGGDAGVQICLACRKRRKRCMRH 
AMCCPGNYCKNGICVSSDQNHFRGEIEETITESFGNDHSTLDGYSRRTTLSSKMYHTKGQEG 
SVCLRSSDCASGLCCARHFWSKICKPVLKEGQVCTKHRRKGSHGLEIFQRCYCGEGLSCRIQ 
KDHHQASNSSRLHTCQRH 

Important features: 
Signal peptide: 

amino acids 1-23 

N-glycosylation site. 

amino acids 256-259 

Fungal Zn{2)-Cys(6) binuclear cluster domain 

amino acids 110-126 



FIGURE 188 

TGTGTTTCCCTGCAGTCAGAATTTGGGACNGCAGGGGTTCCCGGACCTGATTTTGCAGCGGA 
ACGGGAAGGTTTTGTGGGACCCAGGTTGAAATGACGGTCATTTTTTTTTCTTTCTCCTTCNG 
GAGTCCTTNTGAGANGATGGTTTTGGGCGCAGCGGGAGCTAACCCGGTTTTTTGTNGCGATG 
GTAGCGGCGGTTTTCGGCGGCCACCTTNTGCTGGGAGTGAGCGCCACCTTGAATCGGTTTTC 
AATTCCAACGNTATCAAGAACCTGCCCCCACCGNTGGGCGGCGCTGCGGGGCACCCAGGNTT 
TGCAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACA 
ATTACCAGCCGTACCCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTAGT 
CCCACCCGCGGAGGGGANGCGGGCGTGCAAATNTGTNTNGCCTGCAGGAAGCGCCGAAAACG 
CTGCATGCGTCANGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTNTT 
CTGATCAAAATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATGAT 
CATAGCACCTTGGATGGG 



FIGURE 189 

GAGGAACCTACCGGTACCGGCCGCGCGCTGC3TAGTCGCCGGTGTGGCTGCACCTCACCAATCCCGTGCGCCGCGG 

CTGGGCCGTCGGAGAGTGCGTGTGCTTCTCTCCTGCACGCGGTGCTTGGGCTCGGCCAGGCGGGGTCCGCCGCCA 

GGGTTTGAGGATGGGGGAGTAGCTACAGGAi^GCGACCCCGCGATGGCa^GGTATATTTTTGTGGAATGAI^GGA 

AGTATTAGAAATGAGCTGAAGACCATTCAa\GATTAATATTTTTGGGGACAGATTTGTGATGCTTGATTCACCCT 

TGAAGTAATGTAGACAGAAGTTCTCT^TTTGCATATTACATCAACTGGAACCAGCAGTGAATCTTAATGTTCAC 

TTAAATCAGAACTTGCATAAGAAAGAGAATCiGGAGTCTGGTTAAATAAAGATGACTATATCAGAQACrTGAAAA 

GATCATTCTCTGTTTTCTGATAGTGTATATCK3CCATTTTAGTGGGCACAGATCAGGATTTTTACAGTTTACTTGG 

AGTGTCCAAAACTGCAAGCAGTAGAGAAATiWlGACAAGCTTTCAAGAAATTGGCATTGAAGTTACATCCT^ 

AAACCCGAATAACCCAAATGCACATGGCGATTTTTTAAAAATAAATAGAGCATATGAAGTACTCAAAGATGAAGA 

TCTACGGAAAAAGTATGACAAATATGGAGA^kAAGGGACTTGAGGATAATCAAGGTGGCCAGTATGAAAGCTGGAA 

CTATTATCGTTATGATTTTGGTATTTATGATGATGATCCTGAAATCATAACATTGGAAAGAAGAGAATTTGATGC 

TGCTGTTAATTCTGGAGAACTGTGGTTTGTilAATTTTTACTCCCCAGGCTGTTCACACTGCCATGATTTAGCTCC 

CACATGGAGAGACTTTGCTAAAGAAGTGGATGGGTTACTTCGAATTGGAGCTGTTAACTGTGGTGATGATAGAAT 

GCTTTGCCGAATGAAAGGAGTCAACAGCTATCCCAGTCTCTTCATTTTTCGGTCTGGAATGGCCCCAGTGAAATA 

TCS^TGGAGACAGATCAAAGGAGAGTTTAGTGAGTTTTGCaATGCAGCATGTTAGAAGTACaGTGACAG^ 

GACAGGAAATTTTGTCAACTCCATACAAACTGCTTTTGCTGCTGGTATTGGCTGGCTGATCACTTTTTGTTCSy^ 

AGGAGGAGATTGTTTGACTTCACAGACACG?^CTCAGGCTTAGTGGCa.TGTTGTTTOTCAACTCATTGGATGCTAA 

AGAAATATATTTGGAAGTAATACATAATCTTCCAGATTTTGAACTACTTTCGGCAAACACACTAGAGGATCGTTT 

GGCTCaTCATCGGTGGCTGTTATTTTTTCATTTTGGAAAAAATGA?UU^TT(:ayiATGATCCTGAGCTGAAAAAACT 

AAAAACTCTACTTAAAAATGATCATATTCa^lGTTGGCAGGTTTGACTGTTCCTCTGCACCAGACATCTGTAGTAA 

TCTGTATGTTTTTCAGCCGTCTCTAGCaGTATTTAAAGGACAAGGAACCAAAGAATATGAAATTCATCATGGA^ 

GAAGATTCTATATGATATACTTGCCTTTGCCAAAGAAAGTGTGAATTCTCATGTTACCACGCTTGGACCTCAAAA 

TTTTCCTGCCSU^TGACAAAGAACCATGGCTTGTTGATTTCTITGCCCCCTGGTGTCCACCATGTCGAGCTTTACT 

ACCAGAGTTACGAAGAGCATCAAATCTTCTITTATGGTCSIGCTTAAGTTTGGTACACTAGATTGTACAGTTCATG^ 

GGGACrCTGTAACATGTATAACATTCaGGCTTATCG?^CAACAGTGGTATTCaACCAGTCCAACAT^ 

TGAAGGACATCACrCTGCTGAACAAATCTTGIGAGTTCATAGAGGATCTTATGAATCCTTCAGTGGTCTCCCTTAC 

ACCCaCO^CCTTCAACGAACTAGTTACAa^aAGAAAACAa^CGAAGTCTGGATGGTTGATTTCTATTCTCCGTG 

GTGTCATCCTTGCCaAGTCTTAATGCCaLGAa.TGGAAAAGAATGGCCCGGACATTAACTGGACTGATCAACGTGGG 

C^GTATAGATTGCCa^CAGTATCATTCTTTlTGTGCCCAGGAAAACGTTCAAAGATACCCTGAGATAAGATTTTO 

TCCCCCAAAATCAAATAAAGCTTATCAGTATCACAGTTAa^TGGTTGGAATAGGGATGCTTATTCCCTGAGaAT 

CTGGGGTCTAGGATTTTTACCTCAAGTATCCACAGATCTAACACCTCAGACTTTCAGTGAAAAAGTTCTACAAGG 

GAAAAATCATTGGGTGATTGATTTCTATGCTCCTTGGTGTGGACCTTGCCAGAATTTTGCTCCAQAATTTGAGCT 

CTTGGCTAGGATGATTAAAGGAAAAGTGAAfl.GCTGGAAAAGTAGACTGTCa.GGCTTATGCTCaGACATGCCAGAA 

AGCTGGGATCAGGGCCTATCCAACTGTTAAGTTTTATTTCTACGAAAGAGCS^GAGAAATTTTCAAGAAGAGCA 

GATAAATACCy^GAGATGCaAAAGCAATCGCT'GCCTTAATAAGTGAAAAATTGGAAACTCTCCGAAATCAAGGC^ 

GAGGAATAAGGATGAACTTT6ATAATGTTGAAGATGAAGAAAAAGTTTAAAAGAAATTCTGACAGATGACATCAG 

AAGACACCTATTTAGAATGTTACATTTATGATGGGAATGAATGAACATTATCTTAGACTTGCAGTTGTACTGCCA 

GAATTATCTACAGCACTGGTGTAAAAGAAGGGTCTGCAAACTTTTTCTGTAAAGGGCCGGTTTATAAATATTTTA 

GACTTTGCAGGCTATAATATATGGTTOVCACATGAGAACAAGAATAGAGTCATCATGTATTCTTTGTTATTTGCT 

TTTAACAACCTTTAAAAAATATTAAAACGATTCTTAGCTCAGAGCCATACAAAAGTAGGCTGGATTCAGTCCATG 

GACCATAGATTGCTGTCCCCCTCGACGGACTTATAATGTTTCAGGTGGCTGGCTTGAACATGAGTCTGCTGTGCT 

ATCTACATAAATGTCTAAGTTGTATAAAGTCCACTTTCCCTTCACGTTTTTTGGCTGACCTGAAAAGAGGTAACT 

TAGTTTTTGGTCACTTGTTCTCCTAAAAATGCTATCCCTAACCATATATTTATATTTCGTTTTAAAAACACCCAT 

GATGTGGCACAGTAAACAAACCCTGTTATGCTGTATTATTATGAGGAGATTCTTCATTGTTTTCTTTCCTTCTCA 

AAGGTTGAAAAAATGCTTTTAATTTTTCS^CAGCCGAGAAACS^GTGCAGCAGTATATGTGCACACAGT^^ 

AAATTTGAGCAACAGTAAGTGCACAAATTCTGTAGTTTGCTGTATCATCCAGGAAAACCTGAGGGAAAAAAATTA 

TAGCAATTAACTGGGCATTGTAGAGTATCCT.A?^TATGTTATCAAGTATTTAGAGTTCTATATTTTAAAGATATA 

TGTGTTCATGTATTTTCTGAAATTGCTTTCATAGAAATTTTCCCACTGATAGTTGATTTTTGAGGCATCTAATAT 

TTACATATTTGCCTTCTGAACTTTGTTTTGACCTGTATCCTTTATTTACATTGGGTTTTTCTTTCATAGTTTTGG 

TTTTTCACTCCTGTCCAGTCTATTTATTATTCAAATAGGAAAAATTACTTTACAGGTTGTTTTACTGTAGCTTAT 

AATGATACTGTAGTTATTCCAGTTACTAGTTTACTGTCAGAGGGCTGCCTTTTTCAGATAAATATTGACATAATA 

ACTGAAGTTATTTTTATAAGAAAATCAAGTATATAAATCTAGGAAAGGGATCTTCTAGTTTCTGTGTTGTTTAGA 

CTCAAAGAATCACAAATTTGTCAGTAACATGTAGTTGTTTAGTTATAATTCAGAGTGTACAGAATGGTAAAAATT 

CCAATCAGTCAAAAGAGGTCAATGAATTAAA!VGGCTTGCAACTTTTTCAAAAAAAAAAAAAAAAA 



< / us r / s egdb2 / s s t /DNA/Dnas eqs - min/ s s . DNA5 64 3 9 
<subxinit 1 of 1, 747 aa, 1 stop 
<MW: 86127, pi: 7.46, NX(S/T): 2 

MGWLNKDDYIiyDLKRIILCFLIVYMAILVGTDQDFYSLLGVSKTASSREIRQAFKKIJU:.^ 
HPDKNPJSTNPNAHGDFLKIlSIRAYEVLjEaDEDLRKKYDKYGEKGLEDNQGGQYESWNYYRYDFGI 
YDDDPEIITLERREFDAAVNSGELWl^VNFYSPGCSHCHDLAPTWRDFAKEVDGLLRIGAVNC 
GDDRMLCRMKGWSYPSLFIFRSGMAPVKYHGDRSKESLVSFAMQHVRSTVTELWTGNFVNS 
IQTAFAAGIGWLITFCSKGGDCLTSQTRLRLSGMLFLNSLDAKEIYLEVIHNLPDFELLSAN 
TLEDRIJmHRWLLFFHFGKNENSNDPELKKLKTLLKNDHIQVGRFDCSSAPDICSNLYVFQP 
SLAVFKGQGTKEYEIHHGKKILYDI]:iAFAKESVNSHVTTLGPQNFPANDKEPWLVDFFAPWC 
PPCRALLPELRRASNLLYGQLKFGT]jDCTVHEGLCNMYNIQAYPTTWFNQSNIHEYEGHHS 
AEQILEFIEDLMNPSWSLTPTTFNELVTQRKHNEVWMVDFYSPWCHPCQVLMPEWKRMART 
LTGLINVGSIDCQQYHSFCAQENVQRYPEIRFFPPKSNKAYQYHSYNGWNRDAYSLRIWGLG 
FLPQVSTDLTPQTFSEKVLQGKNHW/IDFYAPWCGPCQNFAPEFELLARMIKGKVKAGKVDC 
QAYAQTCQKAGIRAYPTVKFYFYERAICRNFQEEQINTRDAKAIAALISEKLETLRNQGKRNKDEL 

Important features! 

Endoplasmic reticulian targeting sequence. 

amino acids 744-747 

Cytochrome c family heme-binding site signature. 

amino acids 158-163 

Nt-dnaJ domain signature. 

amino acids 77-96 

N-glycosylation site. 

amino acids 484-487 



FIGURE 191 



agacagtacctcctccctaggacta<c!acaaggactgaaccagaaggaagaggacagagcazka 
gccatgaacatcatcctagaaatccttctgcttctgatcaccatcatctactcctacttgga 
gtcgttggtgaagtttttcattcctcagaggagaaaatctgtggctggggagattgttctca 
ttactggagctgggcatggaataggic:aggcagactacttatgaatttgcaaaacgacagagc 
atattggttctgtgggatattaataagcgcggtgtggaggi^ctgcagctgagtgccgaaa 
actaggcgtcactgcgcatgcgtatgtggtagactgcagcaacagagaagagatctatcgct 
ctctaaatcaggtgaagaaagaagtgggtgatgtaacaatcgtggtgaataatgctgggaca 
gtatatccagccgatcttctcagcaccaaggatgaagagattaccaagacatttgaggtcaa 
catcctaggacatttttggatcaca;?kaagcacttcttccatcgatgatggagagaaatcatg 
gccacatcgtcacagtggcttcagtgtgcggccacgaagggattccttacctcatcccatat 
tgttccagcaaatttgccgctgttggctttcacagaggtctgacatcagaacttcaggcctt 
gggaaaaactggtatcaaaacctcatgtctctgcccagtttttgtgaatactgggttcacca 
aaaatccaagcacaagattatggcctgtattggagacagatgaagtcgtaagaagtctgata 
gatggaatacttaccaataagaaaatgatttttgttccatcgtatatcaatatctttctgag 
actacagaagtttcttcctgaacgcgcctcagcgattttaaatcgtatgcagaatattcaat 
ttgaagcagtggttggccacaaaatcaaaatgaaatgaataaataagctccagccagagatg 
tatgcatgataatgatatgaatagtttcgaatcaatgctgcaaagctttatttcacattttt 
tcagtcctgataatattaaaaacattggtttggcactagcagcagtcaaacgaacaagatta 
attacctgtcttcctgtttctcaagjkatatttacgtagtttttcataggtctgtttttcctt 
tcatgcctcttaaaaacttctgtgcttacataaacatacttaaaaggttttctttaagatat 

TTTATTTTTCCATTTAAAGGTGGACi^AAAGCTACCTCCCTAAAAGTAAATACAAAGAGAACT 

tatttacacagggaaggtttaagactgttcaagtagcattccaatctgtagccatgccacag 

AATATCAACAAGAACACAGAATGAGTGCACAGCTAAGAGATCAAGTTTCAGCAGGCAGCTTT 
ATCTCAACCTGGACATATTTTAAGAI^TCAGCATTTGAAAGATTTCCCTAGCCTCTTCCTTTT 

tcattagcccaaaacggtgcaactct7attctggactttattacttgattctgtcttctgtat 
aactctgaagtccaccaaaagtggaccctctatatttcctccctttttatagtcttataaga 

TACATTATGAAAGGTGACCGACTCTATTTTAAATCTCAGAATTTTAAGTTCTAGCCCCATGA 

taacctttttctttgtaatttatgctttcatatatccttggtcccagagatgtttagacaat 

TTTAGGCTCAAAAATTAAAGCTAACACAGGAAAAGGAACTGTACTGGCTATTACATAAGAAA 
CAATGGACCCAAGAGAAGAA 



FIGURE 192 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56409 
<svibunit 1 of 1, 300 aa, 1 stop 
<MW: 33655, pi: 9.31, NX(S/T): 1 

MNIILEILLLLITIIYSYLESLVKFFIPQRRKSVAGEIVLITGAGHGIGRQTTYEFAKRQSI 
LVLWDINKRGVEETAAECRKLGWAJfmYVVDCSNREEIYRSLNQVKKEVGDVTIVVNNAGTV 
YPADLLSTKDEEITKTFEVNILGHFWITKALLPSMMERNHGHIVTVASVCGHEGIPYLIPYC 
SSKFAAVGFHRGLTSELQALGKTGi:EerSCLCPVFVNTGFTKNPSTRLWPVLETDEVVRSLID 
GILTNKKMIFVPSYINIFLRLQKFLPERASAILNRMQNIQFEAWGHKIKMK 

Important features: 
Signal peptide; 

amino acids 1-19 

CAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 30-33 and 58-61 

Short-chain alcohol dehydrogenase family protein 

amino acids 165-202, 37-49, 112-122 and 210-219 



FIGURE 19 



CGGCGGCGGCTGCGGGCGCGAGGTGAGGGGCGCGAGGTGAGGGGCGCGAGGTTCCCAGCAGG 
ATGCCCCGGCTCTGCAGGAAGCTGJ^GTGAGAGGCCCGGAGAGGGCCCAGCCCGCCCGGGGC 
AGGATGACCAAGGCCCGGCTGTTCCIGGCTGTGGCTGGTGCTGGGGTCGGTGTTCATGATCCT 
GCTGATCATCGTGTACTGGGACAGCIGCAGGCGCCGCGCACTTCTACTTGCACACGTCCTTCT 
CTAGGCCGCACACGGGGCCGCCGCTGCCCACGCCCGGGCCGGACAGGGACAGGGAGCTCACG 
GCCGACTCCGATGTCGACGAGTTTCTGGACAAGTTTCTCAGTGCTGGCGTGAAGCAGAGCGA 
CCTTCCCAGAAAGGAGACGGAGCACJCCGCCTGCGCCGGGGAGCATGGAGGAGAGCGTGAGAG 
GCTACGACTGGTCCCCGCGCGACGCCCGGCGCAGCCCA6ACCAGGGCCGGCAGCAGGCGGAG 
CGGAGGAGCGTGCTGCGGGGCTTCTGCGCCAACTCCAGCCTGGCCTTCCCCACCAAGGAGCG 
CGCATTCGACGACATCCCCAACTCGK3AGCTGAGCCACCTGATCGTGGACGACCGGCACGGGG 
CCATCTACTGCTACGTGCCCAAGGTGGCCTGCACCAACTGGAAGCGCGTGATGATCGTGCTG 
AGCGGAAGCCTGCTGCACCGCGGTGCGCCCTACCGCGACCCGCTGCGCATCCCGCGCGAGCA 
CGTGCACAACGCCAGCGCGCACCTGACCTTCAACAAGTTCTGGCGCCGCTACGGGAAGCTCT 
CCCGCCACCTCATGAAGGTCAAGCTCAAGAAGTACACCAAGTTCCTCTTCGTGCGCGACCCC 
TTCGTGCGCCTGATCTCCGCCTTCCGCAGCAAGTTCGAGCTGGAGAACGAGGAGTTCTACCG 
CAAGTTCGCCGTGCCCATGCTGCGGCTGTACGCCAACCACACCAGCCTGCCCGCCTCGGCGC 
GCGAGGCCTTCCGCGCTGGCCTCAAGGTGTCCTTCGCCAACTTCATCCAGTACCTGCTGGAC 
CCGCACACGGAGAAGCTGGCGCCCTTCAACGAGCACTGGCGGCAGGTGTACCGCCTCTGCCA 
CCCGTGCCAGATCGACTACGACTTCGTGGGGAAGCTGGAGACTCTGGACGAGGACGCCGCGC 
AGCTGCTGCAGCTACTCCAGGTGGACCGGCAGCTCCGCTTCCCCCCGAGCTACCGGAACAGG 
ACCGCCAGCAGCTGGGAGGAGGACTGGTTCGCCAAGATCCCCCTGGCCTGGAGGCAGCAGCT 
GTATAAACTCTACGAGGCCGACTTTGTTCTCTTCGGCTACCCCAAGCCCGAAAACCTCCTCC 
GAGACTGAAAGCTTTCGCGTTGCTTTTTCTCGCGTGCCTGGAACCTGACGCACGCGCACTCC 
AGTTTTTTTATGACCTACGATTTTG(ZAATCTGGGCTTCTTGTTCACTCCACTGCCTCTATCC 
ATTGAGTACTGTATCGATATTGTTTTTTAAGATTAATATATTTCAGGTATTTAATACGA 



< /us r / s eqdb2 / s s t /DNA/Dnaseiqs . mi n/ s s . DNA5 6112 
<subunit 1 of 1, 414 aa, 1 stop 
<MW: 48414, pi: 9.54, NX(fJ/T): 4 

MTKARLFRLWLVLGSVFMILL I IVYWDSAGAAHFYLHTSFSRPHTGPPLPTPGPDRDRELTA 
DSDVDEFLDKFLSAGVKQSDLPRKETEQPPAPGSMEESVRGYDWSPRDARRSPDQGRQQAER 
RSVLRGFCANSSLAFPTKERAFDDIPNSELSHLIVDDRHGAIYCYVPKVACTNWKRVMIVLS 
GSLLHRGAPYRDPLRIPREHVHNAS-AHLTFNKFWRRYGKLSRHLMKVKLKKYTKFLFVRDPF 
VRLISAFRSKFELENEEFYRKFAVPMLRLYANHTSLPASAREAFRAGLKVSFANFIQYLLDP 
HTEKLAPFNEHWRQVYRLCHPCQIDYDFVGKLETLDEDAAQLLQLLQVDRQLRFPPSYRNRT 
ASSWEEDWFAKIPLAWRQQLYKLYEZiDFVLFGYPKPENLLRD 

Important features: 
Signal peptide: 

amino acids 1-31 

N-glycosylation sites. 

amino acids 134-137, 209-212, 280-283 and 370-373 

TNFR/NGPR family cysteine-rich region protein 

amino acids 329-332 



FIGURE 195 



tcgggccagaattcggcacgaggcggtcacgagggcgacggcctcacggggctttggaggtga 
aagaggcccagagtagagagagagagagaccgacgtacacgggatggctacgggaacgcgct 
atgccgggaaggtggtggtcgtgaccgggggcgggcgcggcatcggagctgggatcgtgcgc 
gccttcgtgaacagcggggcccgagtggttatctgcgacaaggatgagtctgggggccgggc 
cctggagcaggagctccctggagctgtctttatcctctgtgatgtgactcaggaagatgatg 
tgaagaccctggtttctgagaccatccgccgatttggccgcctggattgtgttgtcaacaac 
gctggccaccacccacccccacagacjgcctgaggagacctctgcccagggattccgccagct 
gctggagctgaacctactggggacgiacaccttgaccaagctcgccctcccctacctgcgga 
agagtcaagggaatgtcatcaacatcitccagcctggtgggggcaatcggccaggcccaggca 
gttccctatgtggccaccaag6gggc:agtaacagccatgaccaaagctttggccctggatga 
aagtccatatggtgtccgagtcaactgtatctccccaggaaacatctggaccccgctgtggg 
aggagctggcagccttaatgccagac:cctagggccacaatccgagagggcatgctggcccag 
ccactgggccgcatgggccagcccgctgaggtcggggctgcggcagtgttcctggcctccga 
agccaacttctgcacgggcattgaac:tgctcgtgacggggggtgcagagctggggtacgggt 
gcaaggccagtcggagcacccccgtggacgcccccgatatcccttc ctgat ttctctcattt 
ctacttggggcccccttcctaggactctcccaccccaaactccaacctgtatcagatgcagc 

CCCCAAGCCCTTAGACTCTAAGCCGP^GTTAGCAAGGTGCCGGGTCACCCTGCAGGTTCCCAT 
AAAAACGATTTGCAGCC 



FIGURE 196 

< /us r / s eqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA5 6 0 4 5 
<subunit 1 of 1, 270 aa, 1 stop 
<MW: 28317, pi: 6.00, NX(S/T): 1 

MATGTRYAGKVVVVTGGGRGIGAGIVRAFVNSGARVVICDKDESGGRALEQELPGAVFILCD 
VTQEDDVKTLVSETIRRFGRLDCWl^nJAGHHPPPQRPEETSAQGFRQLLELNLLGTYTLTKL 
ALPYLRKSQGNVINISSLVGAIGQAQAVPYVATKGAVTAMTKAIALDESPYGVRVNCISPGN 
IWTPLWEELAALMPDPRATIREGML.R.QPLGRMGQPAEVGAAAVFLASEANFCTGIELLVTGG 
AELGYGCKASRSTPVDAPDIPS 

Important features: 
N-glycosylation site. 

amino acids 13 8-141 

Short-chain alcohol dehydrogenase family protein 

amino acids 10-22, 81-91, 134-171 and 176-185 



FIGURE 197 

AGGCGGGCAGCAGCTGCAGGCTGACCTTGCAGCTTGGCGGAATGGACTGGCCTCACAACCTG 
CTGTTTCTTCTTACCATTTCCATCTTCCTGGGGCTGGGCCAGCCCAGGAGCCCCAA?^GCAA 
GAGGAAGGGGCAAGGGCGGCCTGGGCCCCTGGCCCCTGGCCCTCACCAGGTGCCACTGGACC 
TGGTGTCACGGATGAAACCGTATGCCCGCATGGAGGAGTATGAGA6GAACATCGAGGAGATG 
GTGGCCCAGCTGAGGAACAGCTCAGAGCTGGCCCAGAGAAAGTGTGAGGTCAACTTGCAGCT 
GTGGATGTCCAACAAGAGGAGCCTGTCTCCCTGGGGCTACAGCATCAACCACGACCCCAGCC 
GTATCCCCGTGGACCTGCCGGAGGCACGGTGCCTGTGTCTGGGCTGTGTGAACCCCTTCACC 
ATGCAGGAGGACCGCAGCATGGTGAGCGTGCCGGTGTTCAGCCAGGTTCCTGTGCGCCGCCG 
CCTCTGCCCGCCACCGCCCCGCACAGGGCCTTGCCGCCAGCGCGCAGTCATGGAGACCATCG 
CTGTGGGCTGCACCTGCATCTTCTGAATCACCTGGCCCAGAAGCCAGGCCAGCAGCCCGAGA 

ccatcctccttgcacctttgtgcca;!^gaaaggcctatgaaaagtaaacactgacttttgaaa 

GCAAG 



FIGURE 198 

< /usr / seqdb2 / s s t /DNA/Dna seqs . min/ s s . DNA5 9 2 9 4 
<subunit 1 of 1, 180 aa, L stop 
<MW: 20437, pi: 9.58, NX(S/T): 1 

MDWPHNLLFLLTISIFLGLGQPRSPKSKRKGQGRPGPLAPGPHQVPLDLVSRMKPYARMEEY 
ERNI EEMVAQLRNSSEIxAQRKCEVNLQLWMSNKRSLSPWGYS INHDPSRI PVDLPEARCLCL 
GCVNPFTMQEDRSMVSVPVFSQVPVl^RLCPPPPRTGPCRQRAVMETIAVGCTCIF 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 75-78 

Homologous region to IL-17 

amino acids 96-180. 



FIGURE 199 

GCGCCGCCAGGCGTAGGCGGGGTGGCCCTTGCGTCTCCCGCTTCCTTGAAAAACCCGGCGGG 
CGAGCGAGGCTGCGGGCCGGCCGCI'GCCCTTCCCCACACTCCCCGCCGAGAAGCCTCGCTCG 
GCGCCCAACATGGCGGGTGGGCGCTGCGGCCCGCAGCTAACGGCGCTCCTGGCCGCCTGGAT 
CGCGGCTGTGGCGGCGACGGCAGGCCCCGAGGAGGCCGCGCTGCCGCCGGAGCAGAGCCGGG 
TCCAGCCCATGACCGCCTCCAACTGGACGCTGGTGATGGAGGGCGAGTGGATGCTGAAATTT 
TACGCCCCATGGTGTCCATCCTGCCAGCAGACTGATTCAGAATGGGAGGCTTTTGCAAAGAA 
TGGTGAAATACTTCAGATCAGTGTGGGGAAGGTAGATGTCATTCAAGAACCAGGTTTGAGTG 
GCCGCTTCTTTGTCACCACTCTCCG?^GCATTTTTTCATGCAAAGGATGGGATATTCCGCCGT 
TATCGTGGCCCAGGAATCTTCGAAGACCTGCAGAATTATATCTTAGAGAAGAAATGGCAATC 
AGTCGAGCCTCTGACTGGCTGGAAATCCCCAGCTTCTCTAACGATGTCTGGAATGGCTGGTC 
TTTTTAGCATCTCTGGCAAGATATGGCATCTTCACAACTATTTCACAGTGACTCTTGGAATT 
CCTGCTTGGTGTTCTTATGTGTTTTTCGTCATAGCCACCTTGGTTTTTGGCCTTTTTATGGG 
TCTGGTCTTGGTGGTAATATCAGAATGTTTCTATGTGCCACTTCCAAGGCATTTATCTGAGC 
GTTCTGAGCAGAATCGGAGATCAGAGGAGGCTCATAGAGCTGAACAGTTGCAGGATGCGGAG 
GAGGAAAAAGATGATTCAAATGAAGiy^GAAAACAAAGACAGCCTTGTAGATGATGAAGAAGA 
GAAAGAAGATCTTGGCGATGAGGATGAAGCAGAGGAAGAAGAGGAGGAGGACAACTTGGCTG 
CTGGTGTGGATGAGGAGAGAAGTGAGGCCAATGATCAGGGGCCCCCAGGAGAGGACGGTGTG 
ACCCGGGAGGAAGTAGAGCCTGAGGAGGCTGAAGAAGGCATCTCTGAGCAACCCTGCCCAGC 
TGACACAGAGGTGGTGGAAGACTCCTTGAGGCAGCGTAAAAGTCAGCATGCTGACAAGGGAC 
TGTAGATTTAATGATGCGTTTTCAAGAATACACACCAAAACAATATGTCAGCTTCCCTTTGG 
CCTGCAGTTTGTACCAAATCCTTAATTTTTCCTGAATGAGCAAGCTTCTCTTAAAAGATGCT 
CTCTAGTCATTTGGTCTCATGGCAOTAAGCCTCATGTATACTAAGGAGAGTCTTCCAGGTGT 
GACAATCAGGATATAGAAAAACAAAGGTAGTGTTGGGATCTGTTTGGAGACTGGGATGGGAA 
CAAGTTCATTTACTTAGGGGTCAGAGAGTCTCGACCAGAGGAGGCCATTCCCAGTCCTAATC 
AGCACCTTCCAGAGACAAGGCTGCAGGCCCTGTGAAATGAJ^GCCAAGCAGGAGCCTTGGCT 
CCTGAGCATCCCCAAAGTGTAACGTAGAA3CCTTGCATCCTTTTCTTGTGTAAAGTATTTAT 
TTTTGTCAAATTGCAGGAAACATCAGGCACCACAGTGCATGAAAAATCTTTCACAGCTAGAA 
ATTGAAAGGGCCTTGGGTATAGAGAGCAGCTCAGAAGTCATCCCAGCCCTCTGAATCTCCTG 
TGCTATGTTTTATTTCTTACCTTTA^TTTTTCCAGCATTTCCACCATGGGCATTCAGGCTCT 
CCACACTCTTCACTATTATCTCTTGGTCAGAGGACTCCAATAACAGCCAGGTTTACATGAAC 
TGTGTTTGTTCATTCTGACCTAAGGCJGTTTAGATAATCAGTAACCATAACCCCTGAAGCTGT 
GACTGCCAAACATCTCAAATGAAAT(3TTGTGGCCATCAGAGACTCAAAAGGAAGTAAGGATT 
TTACAAGACAGATTAAAAAAAAATT(3TTTTGTCCAAAATATAGTTGTTGTTGATTTTTTTTT 
AAGTTTTCTAAGCAATATTTTTCAAGCCAGAAGTCCTCTAAGTCTTGCCAGTACAAGGTAGT 
CTTGTGAAGAAAAGTTGAATACTGTTTTGTTTTCATCTCAAGGGGTTCCCTGGGTCTTGAAC 
TACTTTAATAATAACTAAAAAACCAGTTCTGATTTTCCTTCAGTGATGTGCTTTTGGTGAAA 
GAATTAATGAACTCCAGTACCTGAAJLGTGAAAGATTTGATTTTGTTTCCATCTTCTGTAATC 
TTCCAAAGAATTATATCTTTGTAAATCTCTCAATACTCAATCTACTGTAAGTACCCAGGGAG 
GCTAATTTCTTT 



FIGURE 200 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA56433 
<subunit 1 of 1, 349 aa, 1 stop 
<MW: 38952, pi: 4.34, NX{S/T): 1 

MAGGRCGPQLTALLAAWIAAVAATAGPEEAALPPEQSRVQPMTASNWTLVMEGEWMLKFYAP 
WCPSCQQTDSEWEAFAKNGEILQISVGKVDVIQEPGLSGRFFVTTLPAFFHAKDGIFRRYRG 
PGIFEDLQl^ILEKKWQSVEPLTGWKSPASLTMSGMAGLFSISGKIWHLHNYFTVTLGIPAW 
CSYVFFVIATLVFGLFMGLVLWISIECFYVPLPRHLSERSEQNRRSEEAHRAEQLQDAEEEK 
DDSNEEENIOJSLVDDEEEKEDLGDEDEAEEEEEEDNIiAAGVDEERSEANDQGPPGEDGVTRE 
EVEPEEAEEGISEQPCPADTEWEDSLRQRKSQHADKGL 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 191-211 

N-glycosylation site. 

amino acids 46-49 

Thioredoxin family proteins. (homologous region to disulfide 

isomerase) 

amino acids 56-72 

Flavodoxin proteins 

amino acids 173-187 



FIGURE 201 



ATCTGGTTGAACTACTTAAGCTTASwTTTGTTAAACTCCGGTAAGTACCTAGCCCACATGATT 
TGACTCAGAGATTCTCTTTTGTCCACAGACAGTCATCTCAGGGGCAGAAAGAAAAGAGCTCC 
CAAATGCTATATCTATTCAGGGGCTCTCAAGj^CA ATGG AATATCATCCTGATTTAGAAAAT 
TTGGATGAAGATGGATATACTCAAI'TACACTTCGACTCTCAAAGCAATACCAGGATAGCTGT 
TGTTTCAGAGAAAGGATCGTGTGCTGCATCTCCTCCTTGGCGCCTCATTGCTGTAATTTTGG 
GAATCCTATGCTTGGTAATACTG6TGATAGCTGTGGTCCTGGGTACCATGGGGGTTCTTTCC 
AGCCCTTGTCCTCCTAATTGGATTATATATGAGAAGAGCTGTTATCTATTCAGCATGTCACT 
AAATTCCTGGGATGGAAGTAAAAGACAATGCTGGCAACTGGGCTCTAATCTCCTAAAGATAG 
ACAGCTCAAATGAATTGGGATTTATAGTAAAACAAGTGTCTTCCCAACCTGATAATTCATTT 
TGGATAGGCCTTTCTCGGCCCCAGACTGAGGTACCATGGCTCTGGGAGGATGGATCAACATT 
CTCTTCTAACTTATTTCAGATCAGAACCACAGCTACCCAAGAAAACCCATCTCCAAATTGTG 
TATGGATTCACGTGTCAGTCATTTATGACCAACTGTGTAGTGTGCCCTCATATAGTATTTGT 
GAGAAGAAGTTTTCAATGTAAGAGGAAGGGTGGAGAAGGAGAGAGAAATATGTGAGGTAGTA 
AGGAGGACAGAAAACAGAACAGAAftAGAGTAACAGCTGAGGTCAAGATAAATGCAGAAAATG 
TTTAGAGAGCTTGGCCAACTGTAATCTTAACCAAGAAATTGAAGGGAGAGGCTGTGATTTCT 
GTATTTGTCGACCTACAGGTAGGCTAGTATTATTTTTCTAGTTAGTAGATCCCTAGACATGG 
AATCAGGGCAGCCAAGCTTGAGTTTTTATTTTTTATTTATTTATTTTTTTGAGATAGGGTCT 
CACTTTGTTACCCAGGCTGGAGTGCAGTGGCACAATCTCGACTCACTGCAGCTATCTCTCGC 
CTCAGCCCCTCAAGTAGCTGGGACTACAGGTGCATGCCACCATGCCAGGCTAATTTTTGGTG 
TTTTTTGTAGA6ACTGGGTTTTGCCATGTTGACCAAGCTGGTCTCTAACTCCTGGGCTTAAG 
TGATCTGCCCGCCTTGGCCTCCCAAAGTGCTGGGATTACAGATGTGAGCCACCACACCTGGC 
CCCAAGCTTGAATTTTCATTCTGCCATTGACTTGGCATTTACCTTGGGTAAGCCATAAGCGA 
ATCTTAATTTCTGGCTCTATCAGA6TTGTTTCATGCTCAACAATGCCATTGAAGTGCACGGT 
GTGTTGCCACGATTTGACCCTCAACTTCTAGCAGTATATCAGTTAT6AACTGAGGGTGAAAT 
ATATTTCTGAATAGCTAAATGAAGAAATGGGAAAAAATCTTCACCACAGTCAGAGCAATTTT 
ATTATTTTCATCAGTATGATCATAATTATGATTATCATCTTAGTAAAAAGCAGGAACTCCTA 
CTTTTTCTTTATCAATTAAATAGCTCAGAGAGTACATCTGCCATATCTCTAATAGAATCTTT 

CACGATCTCGGCTCACCGCAACCTCCGCCCCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCT 

AGAGACAGGGTTTCTCCATGTCGGTCAGGGTAGTCCCGAACTCCTGACCTCAAGTGATCTGC 
CTGCCTCGGCCTCCCAAGTGCTGGGi!lTTACAGGCGTGAGCCACTGCACCCAGCCTAGAATCT 
TGTATAATATGTAATTGTAGGGAAACTGCTCTCATAGGAAAGTTTTCTGCTTTTTAAATACA 
AAAATACATAAAAATACATAAAATCTGATGATGAATATAAAAAAGTAACCAACCTCATTGGA 
ACAAGTATTAACATTTTGGAATATGTTTTATTAGTTTTGTGATGTACTGTTTTACAATTTTT 
ACCATTTTTTTCAGTAATTACTGTAi\AATGGTATTATTGGAATGAAACTATATTTCCTCATG 
TGCTGATTTGTCTTATTTTTTTCATACTTTCCCACTGGTGCTATTTTTATTTCCAATGGATA 
TTTCTGTATTACTAGGGAGGCATTTACAGTCCTCTAATGTTGATTAATATGTGAAAAGAAAT 
TGTACCAATTTTACTAAATTATGCAGTTTAAAATGGATGATTTTATGTTATGTGGATTTCAT 
TTCAATAAAAAAAAACTCTTATCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 202 

</usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA53912 
<subunit 1 of 1, 201 aa, 1 stop 
<MW: 22563, pi: 4.87, NX(S/T): 1 

MEYHPDLENLDEDGYTQLHFDSQSNTRIAWSEKGSCAASPPWRLIAVILGILCLVILVIAV 
VLGTMGVLSSPCPPNWIIYEKSCYLFSMSLNSWDGSKRQCWQLGSNLLKIDSSNELGFIVKQ 
VSSQPDNSFWIGLSRPQTEVPWLWEDGSTFSSNLFQIRTTATQENPSPNCVWIHVSVIYDQL 
CSVPSYS ICEKKFSM 

Important features; 

Type II transmembrane domjiin: 

amino acids 45-65 

CAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 197-200 

N-myristoylation sites. 

amino acids 35-40 and 151-156 

Homologous region to LDL receptor 

amino acids 34-67 and 70-200. 



FIGURE 203 



GGM^GGGGAGGAGCAGGCCa.CaCAGGCAa^GGCCGGTGAGGGACCTGCCCAGACCTGGAGGGTCTCGCTCTGTCa 

CACAGGCTGGAGTGCAGTGGTGTGATCTTCraCTCATCGTi^CCTCCA.CCTCCCGGGTTCAAGTGATTCTCATGCC 

TCAGCCTCCCGAGTAGCTGGGATTACAGGTGGTGACTTCCAAGAGTGACTCCGTCGGAGGAAAMGACTCCCCAG 

TCGCTGCTGCAGACGACACTGTTCCTGCTGAGTCTGCTCTTCCTGGTCCAAGGTGCCCACGGCAGGGGCCACAGG 

GAAGACTTTCGCTTCTGCaGCCaGCGGAACICAGACACACAGGAGCAGCCTCCACTACAAACCaVC^^ 

CGCATCTCCATCGAGaACTCCGAAGAGGCC:CTCACAGTCCATGCCCCTTTCCCTGCAGCCCACCCTGCTTCCCGa. 

TCCTTCCCTGACCCCAGGGGCCTCTACCACTTCTGCCTCTACTGGAACCGACATGCTGGGAGATTACATCTTCTC 

TATGGCAAGCGTGACTTCTTGCTGAGTGACiAAAGCCTCTAGCCTCCTCTGCTTCCaGCACCAGGAGGAGAGCCTG 

GCTCAGGGCCCCCCGCTGTTAGCCACTTCTGTCACCTCCTGGTGGAGCCCTCAGAACATCAGCCTGCCCAGTGCC 

GCC^GCTTCACCTTCTCCTTCCACAGTCCl'CCCCyVCACGGCCGCTCACAATGCCTCGGTGGACATGTGC 

AAAAGGGACCTCCAGCTGCTCAGCCAGTTCCTGAAGCATCCCCAGAAGGCCTCAAGGAGGCCCTCGGCTGC^ 

GCCAGCCAGCAGTTGCaGAGCCTGGAGTCGAAACTGACCTCTGTGAGATTCATGGGGGACATGGTGTCCTTCGAG 

GAGGACCGGATCAACGCCACGGTGTGGAAGCTCCAGCCCACAGCCGGCCTCCAG6ACCTGCACATCCACTCCCGG 

CAGGAGGAGGAGCAGAGCGAGATCATGQAGTACTCGGTGCTGCTGCCTCGAACACTCTTCCAGAGGACGAAAGGC 

CGGAGCGGGQAGGCTGAGAAGAGACTCCTCCTGGTGGACTTCAGCAGCCAAGCCCTGTTCCAGGACAAGAATTCC 

AGCCAAGTCCTGGGTGAGAAGGTCTTGGGGATTGTGGTACAGAACACCAAAGTAGCCAACCTCACGGAGCCCGTG 

GTGCTCACTTTCCAGCACCS^GCTACAGCCG^AAGAATGTGACTCTGCAATGTGTGTTCTGGGTTGAAGACCCCACA 

TTGAGCAGCCCGGGGa^TTGGAGCaGTGCTGGGTGTGAGACCGTC:a.GGAQAGAAACCCAAACATCCTGCTTCTGC 

AACCACTTGACCTACTTTGCAGTGCTGATGK3TCTCCTCGGTGGAGGTGGACGCCGTGCACAAGCACTACCT6AGC 

CTCCTCTCCTACGTGGGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTCACCaTTGCCGCCTACCTCTGCTCCAGG 

GTGCCCCTGCCGTGCAGGAGGAAACCTCGGGACTACACCa.TCAAGGTGCACATG?iACCTGCTGCTGGCCGTCTTC 

CTGCTGGACACGAGCTTCCTGCTCAGCGAGCCGGTGGCCCTQACAGGCTCTGAGGCTGGCTGCCX3AGCCAGTGCC 

ATCTTCCTGCACTTCTCCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGGTACAACCTCTACCGACTCGTG 

GTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACTCAAGCTGAGCGCCATGGGCTGGGGCTTCCCCATCTTT 

CTGGTGACGCTGGTGGCCCTGGTGGATGTGGACAACTATGGCCCCATCATCTTGGCTGTGCATAGGACTCCAGAG 

GGCGTCATCTACCCTTCCATGTGCTGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCTCTTCAGC 

CTGGTGTTTCTGTTCAACATGGCCATGCTAGCCa.CCATGGTGGTGCAGATCCTGCGGCTGCGCCCCCACaCCCS^ 

AAGTGGTCACATGTGCTGAa^CTGCTGGGCCTCAGCCTGGTCCTTGGCCTGCCCTGGGCCTTGATCTTCTTCTCC 

TTTGCTTCTGGCaCCTTCCAGCTTGTCGTCCTCTACCTTTTCAGCATCaTCACCTCCTTCCAAGGC^ 

rrCATCTGGTACTGGTCCATGCGGCTGCaGGCCCGGGGTGGCCCCTCCCCTCTGAAGAGa^CTCAGAGAG^ 

AGGCTCCCCATCAGCTCGGGCAGCSiCCTCGTCCAGCCGCS^TCTAGGCCTCCSiSCCC^ 

CAGAGATGCGGCCrCGTCGCa.CACTGCCTGTGGCCCCCGAGCCAGGCCCAGCCCCAGGCCAGTCAGCCGCA<^ 

TTGGa2^GCCCaACGACaVTGGAGAGATGG(3Ca3TTGCCaTGGTGGACGGACTCCCGGGCTGGGCTTTTG^ 

GCCTTGGGGACTACTCGGCTCTCACTCAGCTCCCACGGGACTCAGAAGTGCGCCGCCATGCTGCCTAGGGTACTG 

TCCCCACATCTGTCCCAACCCa^GCTGGAGGCCTGGTCTCTCCTTACaACCCCTGGGCCCAGCCCTCA 

GGCCAGGCCTTGGATCrTGAGGGTCTGGCa.t3ATCCTTAATCCTGTGCCCCTGCCTGGGACAGA2^TGTGGCTCCA 

GTTGCTCTGTCTCTCGTGGTCACCCTGAGaaaVCTCTGCATCCTCTGTCaTTTTAACCTCAGGTGGCACCCAGGG 

CGAATGGGGCCC:a^GGGCAGACCTTCAGGGC<:a6AGCCCTGGCGGAGGAGAGGCCCTTTGCCAGGAGCa.CAGC^ 

AGCTCGCCTACCTCTGAGCCCAGGCCCCCT(:CCTCCCTCAGCCCCCCAGTCCTCCCTCCATCTTCCCTGGGGTTC 

TCCTCCTCTCCCAGGGCCTCCTTGCTCCTTCGTTCACAGCTGGGGGTCCCCGATTCCAATGCTGTTTTTTGGGGA 

GTGGTTTCCAGGAGCTGCCTGGTGTCTGCTGTAAATGTTTGTCTACTGCACAAGCCTCGGCCTGCCCCTGAGCCA 

GGCTCGGTACCGATGCGTGGGCTGGGCTAGC5TCCCTCTGTCCATCTGGGCCTTTGTATGAGCTGCATTGCCCTTG 

CTCACCCTGACC3^GC3iCAa3CCTCAGAGGCKX:CCTCAGCCTCTCCTGAAGCCCTCTTGTGGCaAGAACTGTGGA 

CCATGCCAGTCCCGTCTGGTTTCCATCCCACCACTCCAAGGACTGAGACTGACCTCCTCTGGTGACACTGGCCTA 

GAGCCTGACACTCTCCTAAGAGGTTCTCTaiAAGCCCCCAAATAGCTCCAGGCGCCCTCGGCCGCCCATCATGGT 

TAATTCTGTCCAACAAACAOVCACGGGTAGJiiTTGCTGGCCTGTTGTAGGTGGTAGGGACACAGATG^ 

GTCACTCCTCCTGCCaACATTa^GTCTGGTATGTGAGGCGTGCGTGAAGCAAGAACTCCTGGAGCTAC^^GGGAa^ 

GGGAGCCATCATTCCTGCCTGGGAATCCTGCa^GACTTCCTGCAGGAGTCAGCGTTCAATCTTGACCTTGAAGAT 

GGGAAGGATGTTCTTTTTACGTACCAATTOTTTTGTCTTTTGATATTAAAAAGAAGTACATGTTCATTGTAGAGA 

ATTTGGAAACTGTAGAAGAGAATCAAGAAGISAAAATAAAAATCaGCTGTTGTAATCGCCTAGCAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAA2iAAAAAAAAAA?AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 204 

</usr/seqc5b2/ sst/DNA/Dnaseqs .min/ss .DNA50921 
<subianit 1 of 1, 693 aa, 1 stop 
<MW: 77738, pi: 8.87, NX(S/T): 7 

MTPQSLLQTTLFLLSLLFLVQGAHGRGHREDFRFCSQRNQTHRSSLHYKPTPDLRISIENSE 
EALTVHAPFPAAHPASRSFPDPRGLYHFCLYWNRHAGRLHLLYGKRDFLLSDKASSLLCFQH 
QEESLAQGPPLLATSVTSWWSPQNISLPSAASFTFSFHSPPHTAAHNASVDMCELKRDLQLL 
SQFLKHPQKASRRPSAAPASQQLQSLESKLTSVRFMGDMVSFEEDRINATWKLQPTAGLQD 
LHIHSRQEEEQSEIMEYSVLLPRTLFQRTKGRSGEAEKRLLLVDFSSQALFQDKNSSQVLGE 

kvx.giwqntkvanltepvvltfqh;qlqpkitvtlqcvfwvedptlsspghwssagcetvrre 

TQTSCFCNHLTYFAVLMVSSVEYDAVHKHYLSLLSYVGCWSALACLVTIAAYLCSRVPLPC 

rrkprdytikvhmnlllavflldtsfllsepvaltgseagcrasaiflhfslltclswmgle 

GYNLYRLWEVFGTYVPGYLLKLSAMGWGFPIFLVTLVALVDVDNYGPIILAVHRTPEGVIY 
PSMCWIRDSLVSYITNLGLFSLVFLFlSnyiAMLATiyrTVQILRLRPHTQKWSHVLTLLGLSLV^ 

LPWALIFFSFASGTFQLWLYLFSIITSFQGFLIFIWYWSMRLQARGGPSPLKSNSDSARLP 
ISSGSTSSSRI 

Important features: 

Signal peptide: 

amino acids 1-25 

Putative transmembrane domains: 

amino acids 382-398, 402-420, 445-468, 473-491, 519-537, 568-590 
and 634-657 

Microbodies C- terminal tairgeting signal. 

amino acids 691-693 

CAMP- and cGMP- dependent protein kinase phosphorylation sites, 
amino acids 198-201 and 370-373 
N-glycosylation sites. 

amino acids 39-42, 148-151, 171-174, 234-237, 303-306, 324-327 
and 341-344 

G-protein coupled receptors family 2 proteins 

amino acids 475-504 



FIGURE 205 

TGCCTGGCCTGCCTTGTCAACAATGCCGCTTACTCTGCTTCCAGGTTGCCCTGCCTTGCAGA 
GGAAANCNTCGGGACTACACCNTCAAGTGCACATGAACCTGCTGCTGGCCGTCTTCCTGCTG 
GACACGAGCTTCCTGCTCAGCGNAGCCGGTGGCCCTGACAGGCTCTGAAGGCTGGCTGCCGA 
GCCAGTGCCATCTTCCTGCACTTCTCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGG 
TACAACCTCTACCGACTCGTGGTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACTCAA 
GCTGAGCGCCATGGGCTGGGGCTTCCCCATCTTTCTGGTGACGCTGGTGGCCCTGGTGGATG 
TGGACAACTATGGCCCCATCATCTTGGCTGTGCATAGGACTCCAGAGGGCGTCATCTACCCT 
TCCATGTGCTGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCTCTTCAGCCT 
GGTGTTTCTGTTCAACATGG 



FIGURE 206 



CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGTTCAGGTCCAGGTTTTGCTTTGA 

TCCTTTTCAAAAACTGGAGACACAGAAGAGGGCTCTAGGAAAJ^GTTTTGGATGGGATTATGTGGAAACTACCCT 

GCGATTCTCTGCTGCCAGAGCAGGCTCGGCGCTTCCACCCCAGTGCAGCCTTCCCCTGGCGGTGGTGAB^^ 

TCGGGAGTCGCTGCTTCCaAAGTGCCCGCCGTGAGTGAGCTCTCACCCCAGTC:AGCCAj ATGA .GCCTCTTCGGGC 

TTCTCCTGCTGACATCTGCCCTGGCCGGCCAGAGS^CAGGGGACTCAGGCGGAATCCaACCTQaGTAGTAAATTCC 

AGTTTTCCAGCAACT^GGii^CAGAACGGAGTACAAGATCCTCAGCATGAGAGAATTATTACTGTG 

GAAGTATTCACAGCCCAAGGTTTCCTO^TACTTATCCAAGAAATACGGTCTTGGTATGGAGATTAGTAGCAGTAG 

AGGAAAATGTATGGATACa^CTTACGTTTGATGAAAGATTTGGGCTTGAAGACCCAGAAGATGACATATGCAAGT 

ATGATTTTGTAGAAGTTGAGGAACCCAGTGATGGAACTATATTAGGGCGCTGGTGTGGTTCTGGTACTGTACCAG 

GAAAACaGATTTCTAAAGGAAATCAAATTA(3GATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCAGGGT 

TCTGCATCCACTACAACATTGTCATGCGaai^TTCACAGAAGCTGTGAGTCCTTCAGTGCTACCCCCTTC^^ 

TGCCACTGGACCTGCTTAATAATGCTATAACrTGCCTTTAGTACCTTGGAAGACCTTATTCGATATCTTGAACCAG 

AGAGATGGCAGTTGGACTTAGAAGATCTATATAGGCO^CTTGGCAACTTCTTGGCAAGGCTTTTGTTTTTGGAA 

GAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAQAGGAGGTAAGATTATACAGCTGCACACCTCGTAACTTCT 

CAGTGTCCATAAGGGAAGAAGTAAAGAGAACCGATACa^TTTTCTGGCa^GGTTGTCTCCTGGTTAAACGCTGTG 

GTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAJSvATACC 

ACGAGGTCCTTCaGTTGAGACCAAAGACCGCSTGTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGC 

ACCATGAGGAGTGTGACTGTGTGTGCAGAG(3GAGCACAGGAGG ATAG CCGCaTCa.CCACCa^GCAGCTCT 

GAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCT 

TCAAGGACCTTTCaiTCTTa^GGATTTACAGTG(:a.TTCTGAAAGAGGAGACATCAAACaGAAOT 

ACAGCTCTTTTGAGAGGAGGCCTAAAGGAa;vGGAGAAAAGGTCTTa!kATCGTGGAAAGAAAATTAAATGTT 

TAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTC 

GATACGGCTTAGGGTAATGTCAGTACAGGAJiAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAAC 

TCTAAAGCTCCATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTTTTTTTTTTTGCTCATATTCACAT 

GTCATGCTGATAGGACAGAOTGQATTTTTC2iTATTOCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAAC^^ 

TTCS^TGGTTTGGAAGAGATAAACCTGAAAAGlAAGAGTGGCCTTATCTTCACTTTATCGATAAGTaiGTTTATTTG 

TTTCATTGTGTACATTTTTATATTCTCCTTTTGAO^TTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCT 

ATTTTTACCAAAGGTATTTAATATTCTTTTTTATGACS^ACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCT 

AAACyiCAATTGTTATAGCCAGAGGAACAAACiATGATATAAAATATTGTTGCTCTGACaAAAATACaTGTATTTC^ 

TTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAA 

GACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAA 

AAGTAGACATTCAGATCCa.GCGATTACTAACCTATTCei^TTTTGGGGAAATCTGAGCCTAGCTC^ 

AAAGCACCTTGAAAAAGACTTGGC:aGCTTCC"rGATAAAGCGTGCTGTGCTGTGCa.GTAGGAACACa^TCCTATTTA 

TTGTGATGTTGTGGTTTTATTATCTTAAACTCTGTTCCATACACTTGTATAAATACIATGGATATTTTTATGTACA 

GAAGTATGTCTCTTAACCAGTTCACTTATTGTACTCTGGCaUiTTTAAAAGAAAATCAGTAAAATATTTTGCTTGT 

AAAATGCTTAATATOGTGCCTAGGTTATGTGGTGACTATTTGAATCaAAAATGTATTGAATCATCAAATAAAAGA 

ATGTGGCTATTTTGGGGAGAAAATTAAAAAAaAAAAAAAAAAAAAAGGTTTAGGGATAACAGGGTAATGCGGCC 



MSLFGLLLLTSALAGQRQGTQAESNjLiSSKFQFSSNKEQNGVQDPQHERIITVSTNGSIHSPR 
FPHTYPRNTVLVWRLVAVEENVWIQLTFDERFGLEDPEDDICKYDFVEVEEPSDGTILGRWC 
GSGTVPGKQISKGNQIRIRFVSDEYFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSALPLDLL 
NNAITAFSTLEDLIRYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRWDLNLLTEEVRLY 
SCTPRNFSVS IREELKRTDTI FWPGGLLVKRCGGNCACCI.HNCNECQCVPS KVTKKYHEVLQ 
LRPKTGVRGLHKSLTDVALEHHEECDCVCRGSTGG 



Signal sequence: 

amino acids 1-14 



CCCATCTCAAGCTGATCTTGGCACCTCTCATGCTCTGCTCTCTTCAACCAGaCCTCTACATTCCATTTTGGAA 

AGACTAAAAATGGTGTTTCO^TGTGGACACTGAAGAGACaAATTCTTATCCTTTTTAAa^ 

AAACTCCTTGGGGCTAGATGGTTTCCTAAAACTCTGCCCTGTGATGTCACTCTGGATGTTCCAAAGAACCATGTG 

ATCGTGGACTGCACAGACAAGCATTTGACAGAAATTCCTGQAGGTATTCCCACGAACaCCACGAACCTCACCCTC 

ACCa.TTAACCACATACCaGACATCTCCCCAGCGTCCTTTCACAQACTGGACC:S^TCTGGTAGAGATCGATTT<^ 

TGC:aACTGTGTACCTATTCC:AC:TGGGGTCA?W^CAACATGTGCATCSAGAGGCTGCaGA^ 

TTTAGTGGACTCACTTATTTAAAATCCCTTTACCTGGATGGAAACCAGCTACTAGAGATACCGCAGGGCCTCCCG 

CCTAGCTTACAGCTTCTCAGCCTTGAGGCCAACAACykTCTTTTCCATCAGAAAAGAGAATCTAACAGAACTG^^ 

AACATAGAAATACTCTACCTGGGCCAAAACTGTTATOATCQAAATCCTTGTTATGTTTCATATTCAATAGAGAAA 

GATGCCTTCCTAAACTTGACAAAGTTAAAAGTGCTCTCCCTGAAAGATAACAATGTCACAGCCGTCCCTACTGTT 

TTGCCATCTACTTTAACAGAACTATATCTCTACAACAACATGATTGCAAAAATCCAAGAAGATGATTTTAATAAC 

CTCAACCAATTACAAATTCTTQACCTAAGTGGAAATTGCCCTCGTTGTTATAATGCCCCATTTCCTTGTGCGCCG 

TGTAAAAATAATTCTCCCCTACAGATCCCTGTAAATGCTTTTGATGCGCTGACAGAATTAAAAGTTTTACGTCTA 

CACAGTAACTCTCTTCAGCATGTGCCCCCAAGATGGTTTAAGAACATCAACAAACTCCAGGAACT 

CAAAACTTCTTGGCO^GAAATTGGGGATGCTAAATTTCTGCATTTTCTCCCCAGCCTCATCa^TTGGATCTG 

TCTTTCAATTTTGAACTTCAGGTCTATCGTGCaTCTATGAATCTATCACAAGCATTTTCTTCACTGaAAAGCCTG 

AAAATTCTGCGGATCAGAGGATATGTCTrasLAAGAGTTGAAAAGCTTTAACCTCTCGCCATTACATAATCTTCAA 

AATCTTGAAGTTCTTGATCTTGGCACTAACTTTATAAAAATTGCTAACCTCAGCATGTTTAAACAATTTAAAAGA 

CTGAAAGTCATAGATCTTTG?IGTGAATAAAATATCACCTTCAGGAGATTCAAGTGAAGTTGGCTTCTGCTCAAAT 

GCCAQAACTTCTGTAGAAAGTTATGAACCC'CyiGGTCCTGGAACAATTA(:yi.TTATTTCAGATATGATAAGTATGC7^ 

AGGAGTTGCAGATTCAAAAACAAAGAGGCTTCTTTCATGTCTGTTAATGaAAGCTGCTACaAGTATGGGCAG^^ 

TTGGATCTAAGTAAAAATAGTATATTTTTTGTCS^GTCCTCTGATTTTCAGCATCTTTCTTTCCTC»L?^^ 

AATCTGTCAGGA?^TCTCATTAGCCAAACTCTTAATGGCAGTGAATTCCAACCTTTAGCAGAGCT^ 

GACTTCTCCaAC^CCGGCTTGATTTACTC'CATTCAACAGCATTTGAAGAGCTTCACaAACTGGAAGTTCTGG^ 

ATAAGO^GTAATAGCCATTATTTTCAATCAiSAAGGAATTACTCATATGCTAAACTTTACCS^GAACCTAAAGGTT 

CTGCAQAAACTGATGATGAACGACAATGAa^TCTCTTCCTCCACCAGCAGGACCATGGAGAGTGAGTCTCTTAGA 

ACTCTGGAATTCAGAGGAAATCACTTAGATGTTTTATGGAGAGAAGGTGATAACAGATACTTACAATTATTCAAG 

AATCTGCTAAAATTAGAGGAATTAGACATCTCTAAAAATTCCCTAAGTTTCTTGCCTTCTGGAGTTTTTGATGGT 

ATGCCTCCAAATCTAAAGAATCTCTCTTTG<3CCAAAAATGGGCTCyVAATCTTTCAGTTGGAAGAAACTCCaL^ 

CTAAAGAACCTGGAAACTTTGGACCTCAGC(2Aa^CC2^CTGACCACTGTCCCTGAGAGATTATCCAACTGTTCC 

AGAAGCCTCAAGAATCTGATTCTTAAGAATi^TCAAATCAGGAGTCTGACGAAGTATTTTCTACAAGATGCCTTC 

CAGTTGCGATATCTGGATCTCaGCTCAAATjyiAATCCAGATGATCCaAAAGACCAGCTTCCCaG^ 

GTTAACCATACGGAGGTGACTATTCCTTACCn'GGCCACAGATGTGACTTGTGTGGGGCCAGGAGCACACAAGGGC 
CAAAGTGTGATCTCCCTGGATCTGTACACCTGTGAGTTAGATCTGACTAACCTGATTCTGTTCTCACTTTCCATA 
TCTGTATCTCTCTTTCTCATGGTGATGATGACAGCAAGTCACOTCTATTTCTGGGATGTGTGGTATATTTACCAT 
TTCTGTAAGGCCAAGATAAAGGGGTATCAGCGTCTAATATCACCAGACTGTTGCTATGATGCTTTTATTGTGTAT 
GACACTAAAGACCCAGCTGTGACCGAGTGGGTTTTGGCTGAGCTGGTGGCCAAACTGGAAGACCCAAGAGAGAAA 
CATTTTAATTTATGTCTCGAGGAAAGGGACTGGTTACCAGGGCAGCCAGTTCTGGAAAACCTTTCCCA6AGCATA 
CAGCTTAGCaAAAAGACAGTGTTTGTGATSlCAGACS^GTATGCAAAGACTGAAAATTTTAAGATAGCATTTT^^ 
TTGTCCCyVTCyiGAGGCTCATGGATGAA2^GTTGaTGTGATTATCTTGATATTTCTTGAGAAGCCCTTTCAGAAG 
TCCAAGTTCCTCCAGCTCCGGAAAAGGCTCTGTGGGAGTTCTGTCCTTGAGTGGCCAACaAACCCGCAAGCTC^^ 
CCATACTTCTGGCa^GTGTCTAAAGAACGCCCTGGCCACAGACAATCATGTGGCCTATAGTCAGGTGTTCAAGGAA 
ACGGTC^AGCCCTTCTTTGCaAAACACAACTGCCTAGTTTACCAAGGAGAGGCCTGGC 



FIGURE 209 



MVFPMWTLKRQILILFNIILISKLLGARWFPKTLPCDVTLDVPKNHVIVDCTDKHLTEIPGG 

I PTNTTNDTLTINHI PD I S PASFHRLDHLVE IDFRCNCVP I PLGSKNNMCIKRLQIKPRSFS 

GLTYLKSLYLDGNQLLEIPQGLPPSLQLLSLEANNIFSIRKENLTELANIEILYLGQNCYYR 

NPCYVSYSIEKDAFLNLTKLKVLSLKDimVTAVPTVLPSTLTELYLYKmi^ 

NQLQILDLSGNCPRCYNAPFPCAPCEasrNSPLQIPVNAFDALTELKVLRLHSNSLQHVPPRWF 

KNINKLQELDLSQNFLAKEIGDAKFLHFLPSLIQLDLSFNFELQVYRASMNLSQAFSSLKSL 

KI LRI RGYVFKELKS FNLS PLHNLQNLE VLDLGTNF IKI ANLSMFKQFKRLKVIDLSVNKI S 

PSGDSSEVGFCSNARTSVESYEPQV]jEQLHYFRYDKYARSCRFKNKEASFMSVNESCYKYGQ 

TLDLSKNSIFFVKSSDFQHLSFLKCLNLSGNLISQTLNGSEFQPLAELRYLDFSNNRLDLLH 

STAFEELHKLEVLDISSNSHYFQSEGITHMmFTKNLKVLQKLMMNDNDISSSTSRTMESES 

LRTLEFRGNHLDVLWREGDNRYLQLFKNLLKLEELDISKNSLSFLPSGVFDGMPPNLKNLSL 

AKNGLKSFSWKKLQCLKNLETLDLSmQLTTVPERLSNCSRSLKNLILKNNQIRSLTKYFLQ 

DAFQLRYLDLSSNKIQMIQKTSFPElTV-LmLKMLLLHHNRFLCTCDAWFVWWVNHTEVTIP 

YLATDVTCVGPGAHKGQSVISLDLYTCELDLTNLILFSLSISVSLFLMVMMTASHLYFWDVW 

YIYHFCKAKIKGYQRLISPDCCYDAFIVYDTKDPAVTEWVLAELVAKLEDPREKHFNLCLEE 

RDWLPGQPVLENLSQSIQLSKKTVFWTDKYAKTENFKIAFYLSHQRLMDEKVDVIILIFLE 

KPFQKSKFLQLRKRJiCGSSVLEWPTlTPQAHPYFWQCLKNALATDNHVAYSQW 

Signal sequence: 

amino acids 1-26 

Transmembrane domain: 

amino acids 840-860 



FIGURE 210 



GGGTACCa.TTCTGCGCTGCTGCaAGTTACGG?^TG&AAAATTAGAACJ^CAGAAA CATGG AflAa.CaTGTTCCTTC 

AGTCGTCAATGCTGACCTGCATTTTCCTGCT.aATATCTGGTTCCTGTGAGTTATGCGCCGAAGAAAATTTTTCTA 

GAAGCTATCCTTGTGATGAGAAAAAGCaAAATGACTCAGTTATTGCAGAGTGCaVGCAATCGTCGACTACaGGAAG 

TTCCCCaAACGGTGGGCAAATATGTGACy^GAACTAGACCTGTCTGATAATTTCATCACACACATAACGAATGAAT 

CATTTCaAGGGCTGCAAAATCTCaCTAAAAT.Z^TCTAAACCACAACCCCAATGTACAGC:^ 

CCGGTATACAATCAAATGGCTTGAATATCACAGACGGGGCATTCCTCAACCTAAAAAACCTAAGGGAGTTACT 

TTGAAGACAACCAGTTACCCCAAATACCCTCTGGTTTGCCSIGAGTCTTTGACAGAACTTAGTCTAATTGR^^ 

ATATATACAACATAACTAAAGAGGGCATTTCAAGACTTATAAACTTGAAAAATCTCTATTTGGCCTGGAACTGCT 

ATTTTAACAAAGTTTGCQAGAAAACTAACATAGAAGATGGAGTATTTGAAACGCTGACyiAATTTGGAGTTGCTAT 

CACTATCTTTCAATTCTCTTTCACACGTGCC^CCCAAACTGCCAAGCTCCCTACGCAAACTTTTTCTGAGC^ 

CCCAGATCAAATACATTAGTGAAGAAGATTTCAAGGGATTGATAAATTTAACATTACTAGATTTAAGCGGGAACT 

GTCCGAGGTGCTTCaATGCCCCATTTCCATGCGTGCCTTGTGATGGTGGTGCTTCAATTAATATAGATCGTTTTG 

CTTTTCAAAACTTGACCCAACTTCGATACCTAAACCTCTCTAGCACTTCCCTCAGGAAGATTAATGCTGCCTGGT 

TTAAAAATATGCCTCATCTGAAGGTGCTGGATCTTGAATTCAACTATTTAGTGGGAGAAATAGTCTCTGGGGCAT 

TTTTAACGATGCTGCCCCGCTTAGAAATACTrGACTTGTCTTTTAACTATATAAAGGGGAGTTATCCACAGCATA 

TTAATATTTCCAGAAACTTCTCTAAACTTTTGTCTCTACGGGCATTGCATTTAAGAGGTTATGTGTTCCAGGAAC 

TCAGAGAAGATGATTTCCAGCCCCTGATGCAGCTTCCAAACTTATCGACTATCAACTTGGGTATTAATTTTATTA 

AGCAAATCGATTTCaAACTTTTCCAAAATTTCTCCAATCTGGAAATTATTTACTTGTCAGAAAACAGAATATCAC 

CGTTGGTAAAAGATACCCGGCAGAGTTATGCAAATAGTTCCTCTTTTCSyiCGTCATATCCGGAAACGACGCTCAA 

CAQATTTTGAGTTTQACCCy^CATTCGAACTTTTATCATTTCACCCGTCCTTTAATAAAGCCyiCAATGTGCTGCTT 

ATGGAAAAGCCTTAGATTTAAGCCTCAACAGTATTTTCTTCATTGGGCCaAACCaATTTGAAAATCTTCCTGAC^ 

TTGCCTGTTTAAATCTGTCTGCAAATAGCAATGCTCAAGTGTTAAGTGGAACTGAATTTTCAGCCATTCCTCATG 

TCAAATATTTGGATTTGACAAACAATAGACTAGACTTTGATAATGCTAGTGCTCTTACTGAATTGTCCGACTTGG 

AAGTTCTAGATCTCT^GCTATAATTCACACTATTTCAGAATAGCAGGCGTAACACATCATCTAGAATTTATTCAAA 

ATTTCACAAATCTAAAAGTTTTAAACTTGAGCCACaACaACaTTTATACTTTAACAGATAAGTAT^ 

GCAAGTCCCTGGTAGAATTAGTTTTCAGTGGCAATCGCCTTGACa.TTTTGTGGAATGATGATGACAAC^ 

TCTCCyiTTTTCAAAGGTCTCAAGAATCTGACACGTCTGGATTTATCCCTTAATAGGCTGAAGCACATCCa^ 

AAGCATTCCTTAATTTGCCAGCQAGTCTCACTGAACTACATATAAATGATAATATGTTAAAGTTTTTTAACTGGA 

CATTACTCCAGCAGTTTCCTCGTCTCGAGTTGCTTGACTTACGTGGAAACAAACTACTCTTTTTAACTGATAGCC 

TATCTGACTTTACSITCTTCCCTTCGGACACTGCTGCTGAGTCATAACAGGATTTCCCACCTACCCTCTGGCTTTC 

TTTCTGAAGTCAGTAGTCTGAAGCACCTCGArTTAAGTTCCAATCTGCTAAAAACAATCaACaAATCCGCACT 

AAACTAAGACO^CCACCAAATTATCTATGTTiSGAACTACACGGAAACCCCTTTGAATGCACCTGTGACATTGGAG 

ATTTCCGAAGATGGATGGATGAACATCTGAATGTCAAAATTCCCAGACTGGTAGATGTCATTTGTGCC^ 

GGGATCAAAGAGGGAAGAGTATTGTGAGTCTGGAGCTAACaACTTGTGTTTCAGATGTCACTGCU^GTGAT^ 

TTTTCTTCACGTTCTTTATCACCACCATGGTTATGTTGGCTGCCCTGGCTCACCATTTGTTTTACTGGGATGTTT 

GGTTTATATATAATGTGTGTTTAGCTAAGGTAAAAGGCTACAGGTCTCTTTCCACM?CCCAAACTTTCTATGATG 

CTTACATTTCTTATGACACCAAAGATGCCTCTGTTACTGACTGGGTGATAAATGAGCTGCGCTACCACCTTGAAG 

AGAGCCGAGACAAAAACGTTCTCCTTTGTCTAGAGGAGAGGGATTGGGACCCGGGATTGGCCATG^TCGACa^ 

TCATGCAGAGCATCAACCaAAGCaAGAAAAGAGTATTTGTTTTAACCAAAAAATATGCAAAAAGCTGG;^ 

AAACAGCTTTTTACTTGGCTTTGCAGAGGCTAATGGATGAGAAGATGGATGTGATTATATTTATCCTGCTGGAGC 

CAGTGTTACAGCATTCTCAGTATTTGAGGCTACGGCa.GCGGATCTGTAAGAGCTCCATCCTCCAGTGGCCTGACA 

ACCCGAAGGCAGAAGGCTTGTTTTGGCAAACTCTGAGAAATGTGGTCTTGACTGAAAATGATTCACGGTATAACA 

ATATGTATGTCGATTCCATTAAGa^TACmACTGACGTTAAGTCATQATTTCGCGCCATAATAAAGATGCAAAG 

GAATGACATTTCTGTATTAGTTATCTATTGCTATGTAAOWVTTATCCaUlAACTTAGTGGTTTAAAAC^ 

TTTGCTGGCCCA(:a.GTTTTTGAGGGTCAGGAGTCCAGGCCC:AGCATAACTGGGTCCTCTGCTCAGGGTGTCTa^ 

AGGCTGCAATGTAGGTGTTCACCAGAGACATAGGCATCACTGGGGTCACACTa^TGTGGTTGTTTTCTGGATTC^ 

ATTCCTCCTGGGCTATTGGCCAAAGGCTATACTaVTGTAAGCCATGCGAGCCTCTCCCACAAGGCAGCTTGCTTC 

ATCAGAGCTAGCAAAAAAGAGAGGTTGCTAGCS^GATGAAGTCACAATCTTTTGTAATCGAATCAAAAAAG 

lATCTCATCaCTTTGGCCATATTCTATTTGTTAGAAGTAAACCACAGGTCCCACCAGCTCCATGGGAGTGACCACC 

TCAGTCCAGGGAAAACAGCTGAAGACCa^GATGGTGAGCTCTGATTGCTTCAGTTGGTCATCAACTATT^ 

TGACTGCTGTCCTGGGATGGCCTGCTATCTTGATGATAGATTGTGAATATCAGGAGGCAGGGATCACTGTGGACC 

ATCTTAGCAGTTGACCTAACACATCTTCTTTTCAATATCTAAGAACTTTTGCCACTGTGACTAATGGTCCTAATA 

TTAAGCTGTTGTTTATATTTATCATATATCTATGGCTACATGGTTATATTATGCTGTGGTTGCGTTCGGTTTTAT 

TTACAGTTGCTTTTACAAATATTTGCTGTAAi^ATTTGACTTCTAAGGTTTAGATGCCATTTAAGAACTGAGATGG 

ATAGCTTTTAAAGCATCTTTTACTTCTTACGATTTTTTAAAAGTATGCAGCTAAATTCGAAGCTTTTGGTCTATA 

TTGTTAATTGCCATTGCTGTAAATCTTAAAATGAATGAATAAAAATGTTTCATTTTACAAAAAAAAAAAAAAAA 
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MENMFLQSSMLTCIFLLISGSCELCAEENFSRSYPCDEKKQNDSVIAECSNRRLQEVPQTVG 
KYVTELDLSDNFITHITNESFQGLQWLTKINLNHNPNVQHQNGNPGIQSNGLNITDGAFLNL 
KNLRELLLEDNQLPQIPSGLPESLTELSLIQNNIYNITKEGISRLINLKNLYLAWNCYFNKV 
CEKTNIEDGVFETLTNLELLSLSFNSLSHVPPKLPSSLRKLFLSNTQIKYISEEDFKGLINL 
TLLDLSGNCPRCFNAPFPCVPCDGGAS INIDRFAFQNLTQLRYLNLS STSLRKINAAWFKNM 
PHLKVLDLEFNYLVGEIVSGAFLTMLPRLEILDLSFNYIKGSYPQHINISRNFSKLLSLRAL 
HLRGYVFQELREDDFQPLMQLPNLSTINLGINFIKQIDFKLFQNFSNLEIIYLSENRISPLV 
KDTRQS YANSS SFQRHIRKRRSTDFEFDPHSNFYHFTRPLIKPQCAAYGKALDLSLNS I FFI 
GPNQFENLPDIACLNLSANSNAQVLSGTEFSAIPHVKYLDLTNNRLDFDNASALTELSDLEV 
LDLSYNSHYFRIAGVTHHLEFIQNF-rNLKVLNLSHNNIYTLTDKYNLESKSLVELVFSGNRL 
DILWNDDDNRYISIFKGLKNLTRLDLSLNRLKHIPNEAFLNLPASLTELHINDNMLKFFNWT 
LLQQFPRLELLDLRGNKLLFLTDSLSDFTSSLRTLLLSHNRISHLPSGFLSEVSSLKHLDLS 
SNLLKTINKSALETKTTTKLSMLELHGNPFECTCDIGDFRRWMDEHLNVKIPRLVDVICASP 
GDQRGKSIVSLELTTCVSDVTAVILFFFTFFITTMVMLAAIAHHLFYWDVWFIYNVCLAKVK 
GYRSLSTSQTFYDAYISYDTKDASWDWVINELRYHLEESRDKNVLLCLEERDWDPGLAIID 
NLMQSINQSKKTVFVLTKKYAKSWNFKTAFYLALQRLMDENMDVIIFILLEPVLQHSQYLRL 
RQRI CKS S ILQWPDNPKAEGLFWQTIiRNVVLTENDSRYlSrNMYVDS I KQY 

Signal sequence: 

amino acids 1-26 

Transmembrane domain : 

amino acids 826-848 



CCAGGTCCAACTGCACCTCGGTTd'ATCGATTGAATTCCCCGGGGATCCTCTAGAGATCCCT 
CGACCTCGACCCACGCGTCCGCCAJ^GCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGA 
CAGGCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGC 
AAGGGCTAGGGTCCATCTCCAGTCC:CAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGC 
TCCAGCAGCATCAGCAGCCCCCAGG'ACCGGGGAGGCACAGGTGGCCCCCACCACCCGGAGGA 
GCAGCTCCTGCCCCTGTCCGGGGGA.TGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGAA 
GGCCACCCCGCCTGGAGGCACAGGC;CATGAGGGGCTCTCAGGAGGTGCTGCTGATGTGGCTT 
CTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTAGGGTGTGTGCTGT 
CCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCCTCA 
CCACCTGCGACGGGCACCGGGCCTGiCAGCACCTACCGAACCATCTATAGGACCGCCTACCGC 
CGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAGGAC 
CAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAGGGA 
GCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAGTCA 
GATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCATCAACACCGCCGG 
CAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTGTGC 
CCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAGGAA 
GAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCTGGC 
CCCACTGCACAGCCTGGCCTCGCAGiGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCCTCC 
TGGTGCACTCCTTCCA6CAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTCCTG 
GAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGACTCGTGACTGCCCAGCGCCCCAGGCTG 
GACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTCCAG 
AAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTCTTCCTCCTCCCC 
TTCCTCGGGAGGCTCCCCAGACCCTSGCATGGGATGGGCTGGGATCTTCTCTGTGAATCCAC 
CCCTGGCTACCCCCACCCTGGCTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCAGCTG 
AGGGAAGGTACGAGCTCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCCCGGAG 
GCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAACGTGA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGT 
CGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATGGTTACAAAT 
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MRGSQEVLLMWLLVIAVGGTEHAYIIPGRRVCAVRAHGDPVSESFVQRWQPFLTTCDGHI^ 
STYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPPCRNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCINTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVHSFQQLG 
RIDSLSEQI SFLEEQLGS CSCKKDS 

Signal sequence: 

1-19 



i 
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GCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGCAAG 
GGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGCTCC 
AGCAGCATCAGAGCAGCCCCTGTGGfTTGGCAGCAAAGTTCAGCTTGGCTGGGCCCGCTGTGA 
GGGGCTTCGCGCTACGCCCTGCGGTGTCCCGAGG6CTGAGGTCTCCTCATCTTCTCCCTAGC 
AGTGGATGAGCAACCCAACGGGGGCCCGGGGAGGGGAACTGGCCCCGAGGGAGAGGAACCCC 
AAAGCCACATCTGTAGCCAGGATGAGCAGTGTGAATCCAGGCAGCCCCCAGGACCGGGGAGG 
CACAGGTGGCCCCCACCACCCGGAGGAGCAGCTCCTGCCCCTGTCCGGGGGATGACTGATTC 
TCCTCCGCCAGGCCACCCAGAGGAGAAGGCCACCCCGCCTGGAGQCACAGGCC ATGA GGGGC 
TCTCAGGAGGTGCTGCTGATGTGGCTTCTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTA 
CCGGCCCGGCCGTAGGGTGTGTGCTGTCCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTC6 
TGCAGCGTGTGTACCAGCCCTTCCTCACCACCTGCGACGGGCACCGGGCCTGCAGCACCTAC 
CGAACCATCTATAGGACCGCCTACCGCCGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTA 
CGCGTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATAT 
GCCAGCCGCCATGCCGGAACGGAGGGAGCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCA 
GGATGGCGGGGTGACACTTGCCAGTCAGATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTG 
TCCCCAGCGCTGCATCAACACCGCCC3GCAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCC 
TGTCTGCAGACGGTACACTCTGTGTGCCCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCC6 
ACAGGAGTGGACAGTGCAATGAAGGJiiAGAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCT 
GGAGGAGAAGCTGCAGCTGGTGCTGGCCCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGC 
ATGGGCTCCCGGACCCCGGCAGCCTCCTGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGAC 
TCCCTGAGCGAGCAGATTTCCTTCCTGGAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGA 
CTCGTGACTGCCCAGCGCTCCAGGCTGGACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATG 
CCCCTGCCCAACATGCTGGGGGTCCi^GAAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGC 
AGGGCCTTCCTCCTCTTCCTCCTCCCCTTCCTCGGGAGGCTCCCCAGACCCTGGCATGGGAT 
GGGCTGGGATCTTCTCTGTGAATCCfi.CCCCTGGCTACCCCCACCCTGGCTACCCCAACGGCA 
TCCCAAGGCCAGGTGGACCCTCAGCTGAGGGAAGGTACGAGCTCCCTGCTGGAGCCTGGGAC 
CCATGGCACAGGCCAGGCAGCCCGGA.GGCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGAC 
CCCCAGCACAATAAAAATGAAACGTG 
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MRGSQEVLIiMWLLVLAVGGTEHAYRPGRRVCAVRAHGDPVSESFVQRVYQPFLTTCDGHRAC 
STYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPPCRNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCINTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVHSFQQLG 
RIDSLSEQISFLEEQLGSCSCKECDS 



Signal sequence: 
1-19 
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CCCACGCGTCCGAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGACAGGCCAGGCA 
G6TGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGCAAGGGCTAGGG 
TCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGCTCCAGCAGCAT 
CAGCAGCCCCCAGGACCGGGGAGGC2\.CAGGTGGCCCCCACCACCCGGAGGAGCAGCTCCTGC 

ccctgtccgggggatgactgattctcctccgccaggccacccagaggagaaggccaccccgc 
ctggaggcacaggcc atga ggggctctcaggaggtgctgctgatgtggcttctggtgttggc 
agtgggcggcacagagcacgcctacc:ggcccggccgtagggtgtgtgctgtccgggctcacg 
gggaccctgtctccgagtcgttcgtgcagcgtgtgtaccagcccttcctcaccacctgcgac 
gggcaccgggcctgcagcacctaccgjaaccatctataggaccgcctaccgccgcagccctgg 
gctggcccctgccaggcctcgctacgcgtgctgccccggctggaagaggaccagcgggcttc 
ctggggcctgtggagcagcaatatgccagccgccatgccggaacggagggagctgtgtccag 
cctggccgctgccgctgccctgcaggatggcggggtgacacttgccagtcagatgtggatga 
atgcagtgctaggaggggcggctgtccccagcgctgcgtcaacaccgccggcagttactggt 
gccagtgttgggaggggcacagcctgtctgcagacggtacactctgtgtgcccaag6gaggg 
ccccccagggtggcccccaacccgacaggagtggacagtgcaatgaaggaagaagtgcagag 
gctgcagtccagggtggacctgctggaggagaagctgcagctggtgctggccccactgcaca 
gcctggcctcgcaggcactggagcatgggctcccggaccccggcagcctcctggtgcactcc 
ttccagcagctcggccgcatcgactccctgagcgagcagatttccttcctggaggagcagct 
ggggtcctgctcctgcaagaaagactcgtgactgcccagcgccccaggctggactgagcccc 
tcacgccgccctgcagcccccatgcccctgcccaacatgctgggggtccagaagccacctcg 
gggtgactgagcggaaggccaggcagggccttcctcctcttcctcctccccttcctcgggag 
gctccccagaccctggcatgggatgggctgggatcttctctgtgaatccacccctggctacc 
cccaccctggctaccccaacggcatcgcaaggccaggtgggccctcagctgagggaaggtac 
gagctccctgctggagcctgggaccci^tggcacaggccaggcagcccggaggctgggtgggg 
cctcagtgggggctgctgcctgacccccagcacaataaaaatgaaacgtg 
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MRGSQEVLLMWLLVLAVGGTEHAYRPGRRVCAVRAHGDPVSESFVQRVYQPFLTTCDGHRAC 
STYRTIYRTAYRRSPGLAPARPRYACCPGWKRTS6LPGACGAAICQPPCRNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCVNTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVHSFQQLG 
RIDSLSEQISFLEEQLGSCSCKKDS 

Signal sequence: 
1-19 
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ggttgccaovgctggtttagggccccgacaivctggggccccttgtcaggaggagacagcctcccggcccgggc^ 
gacaagtcgctgccacctttggctgccgacgtgattccctgggacggtccgtttcctgccgtcagctgccggccg 
agttgggtctccgtgtttcaggccggctcccccttcctggtctcccttctcccgctgggccggtttatcgggagg 
agattgtcttccagggctagcaattggacttttgatgatgtttgacccagcggcaggaatagcaggcaacgtqat 
ttcaaagctgggctca^gcctctgtttcttctctcgtgtaatcgcaaaacccattttggagcaggaattccaatca 
tgtctgtgatggtggtgagaaagaaggtgaca^cggaaatgggagaaactcccaggcaggaacaccttttgctgtg 
atggccgcgtcatgatggcccggcaaaagg(3c7kttttctacctgacccttttcctcatcctggggaca^ 
tcttcttcgcctttgagtgccgctacctggctgttcagctgtctcctgccatccctgtatttgctgccatgctct 
tccttttctccatggctacactgttgaggaccagcttcagtgaccctggagtgattcctcgggcgctaccagatg 
aagcagctttcatagaaatggagatagaag(:taccaatggtgcggtgccccagggccagcgaccaccgcctcgta 
• tcaagaatttccagataaacsaccagattgtgaaactgaaatactgttacacatgcaagatcttccggcctcccc 
gggcctcccattgcagcatctgtgacaactgtgtggagcgcttcgaccatcactgcccctgggtggggaattgtg 
ttggaaagaggaactaccgctacttctacctcttcatcctttctctctccctcctcacaatctatgtcttcgcct 
tcaacatcgtctatgtggccctcaaatctttgaaaattggcttcrtggaga(:a.ttgaaagaaactcctggaactg 
ttctagaagtcctcatttgcttctttacactctggtccgtcgtgggactgactggatttcatactttcctcgtgg 
ctctcaaccagacaaccaatgaagacatca^^gqatcatggaaigggaagaatcgcgtccagaatccctacagcc 
atggcaatattgtgaagaactgctgtgaagtgctgtgtggccccttgccccccagtgtgctggatcgaaggggta 
ttttgccactggaggaaagtggaagtcgacctcccagtactcaagagaccagtagcagcctcttgccacagagcc 
c^gcccccacagaacacctgaactcs^tgjvgatgccggaggacyigcagcactcccgaag^ 

taattagggctatgagagatttcaggtgagzlagttaaacctgagacagagagcaagtaagctgtcccttttaact 
gtttttctttggtctttagtcacccagttgciacactggcattttcttgctgcaagcttttttaaatttctgaact 
caaggcagtggca.gaagatgtcagtcacctctqataactggaaaaatgggtctcttgggccctggc:actggttct 
ccatggcctcagccacy^gggtccccttggaccccctctcttccctccagatcccagccctcctgcttggggtcac 
tggtctcs^ttctggggctaaaagtttttgagactggctcaaatcctcccaagctgctgcacgtgctgagtccaga 
ggcagtcacagagacctctggccaggggatcctaactgggttcttggggtcttcaggactgaagaggagggagag 
tggggtcagaagattctcctggccacct^gtgccagcattgcccacswvatccttttaggaatgggacaggtacct 

TCCaCTTGTTGTANNNNNNNNNl^^ 

CAGGAATGGC^GTAATAAAAGTCTGCACTTTGGTCATTTCTTTTCCTCAGAGGAAGCCCGAGTGCTCACTTA^ 

ACTATCCCCTCAGACTCCCTGTGTGAGGCCTGCAGAGGCCCTGAATGCACAAATGGGAAACa^GGCACAGAGAG 

GCTCTCCTCTCCTCTCCTCTCCCCCGATGTACCCTCAAAAAAAAAAAAATGCTAACCAGTTCTTCCATTAAGCCT 

CGGCTGAGTQAGGGAAAGCCCAGCACTGCTGCCCTCTCGGGTAACTCACCCTAAGGCCTCGGCCCACCTCTGGCT 

ATGGTAACCACACTGGGGGCTTCCTCCAAGCCCCGCTCTTCCAGCACTTCCACCGGCAGAGTCCCAGAGCCACTT 

c:y^CCCTGGGGGTGGGCrGTGGCCCCCAGTC^kGCTCTGCTCSVGGACCTGCTCTATTTCa.GGGAAGAAGATTTATO^ 

ATTATATGTGGCTATATTTCCTAGAGCACCTGTGTTTTCCTCTTTCTAAGCCAGGGTCCTGTCTGGATGACTTAT 

GCGGTGGGGGAGTGTAAACCGGAACTTTTCffi.TCTATTTGAAGGCGATTAAACTGTGTCTAATGCA 
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MSVMVWKKVTRKWEKLPGRNTFCCDGRVMMARQKGI FYLTLFLI LGTCTLFFAFECRYLAV 
QLSPAIPVFAAMLFLFSMATLLRTSFSDPGVIPRALPDEAAFIEMEIEATNGAVPQGQRPPP 
RIKNFQINNQIVKLKYCYTCKIFRPPRASHCSICDNCVERFDHHCPWVGNCVGKRNYRYFYL 
FILSLSLLTIYVFAFNIVYVALKSLKIGFLETLKETPGTVLEVLICFFTLWSWGLTGFHTF 
LVALNQTTNEDI KGSWTGKNRVQNPYSHGNI VKNCCEVLCGPLPPSVLDRRGI LPLEESGSR 
PPSTQETSSSLLPQSPAPTEHLNSNEMPEDSSTPEEMPPPEPPEPPQEAAEAEK 



Putative transmembrane domains: 

amino acids 36-55 (type II TM) , 65-84, 188-208, 229-245 



FIGURE 220 

AAAACCCTGTATTTTTTACAATGCAi^TAGACAATNANCCTGGAGGTCTTTGAATTAGGTAT 
TATAGGGATGGTGGGGTTGATTTTTirrTCCTGGAGGCTTTTGGCTTTGGACTCTCNCTTTCT 
CCCACAGAGCNCTTCGACCATCACTGCCCCTGGGTGGGGAATTGTGTTGGAAAGAGGAACTA 
CCGCTANTTCTACCTCTTCATCCTT'rNTCTCTCCCNCCTCACAATCTATGTCTTCGCCTTCA 
ACATCGT 
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GTTGTGTCCTTCAGCAAAACAGTGGATTTAAATCTCCTTGCACAAGCTTGAGAGCAACACAA 
TCTATCAGGAAAGAAAGAAAGAAAA:!y^CCGAACCTGACAAAAAAGAAGAAAAAGAAGAAGA 
AAAAAAATCATGAAAACCATCCAGCCAAAAATGCACTiATTCTATCTCTTGGGCAATCTTCAC 
GGGGCTGGCTGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAGATGCCACCTTCC 
CCAAAGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTATT 
GACAACCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAATGA 
CAAGTGGTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAACGCAGTACAGCATCG 
AGATCCAGAACGTGGATGTGTATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACAAC 
CACCCAAAGACCTCTAGGGTCCACCTCATTGTGCAAGTATCTCCCAAAATTGTAGAGATTTC 
TTCAGATATCTCCATTAATGAAGGGJACAATATTAGCCTCACCTGCATAGCAACTGGTAGAC 
CAGAGCCTACGGTTACTTGGAGACACATCTCTCCCAAAGCGGTTGGCTTTGTGAGTGAAGAC 
GAATACTTGGAAATTCAGGGCATCACCCGGGAGCAGTCAGGGGACTACGAGTGCAGTGCCTC 
CAATGACGTGGCCGCGCCCGTGGTACGGAGAGTAAAGGTCACCGTGAACTATCCACCATACA 
TTTCAGAAGCCAAGGGTACAGGTGTCCCCGTGGGACAAAAGGGGACACTGCAGTGTGAAGCC 
TCAGCAGTCCCCTCAGCAGAATTCCA.GTGGTACAAGGATGACAAAAGACTGATTGAAGGAAA 
GAAAGGGGTGAAAGTGGAAAACAGACCTTTCCTCTCAAAACTCATCTTCTTCAATGTCTCTG 
AACATGACTATGGGAACTACACTTGCGTGGCCTCCAACAAGCTGGGCCACACCAATGCCAGC 
ATCATGCTATTTGGTCCAGGCGCCGTCAGCGAGGTGAGCAACGGCACGTCGAGGAGGGCAGG 
CTQCGTCTGGCTGCTGCCTCTTCTGGTCTTGCACCTGCTTCTCAAATTT TGAT GTGAGTGCC 
ACTTCCCCACCCGGGAAAGGCTGCCGCCACCACCACCACCAACACAACAGCAATGGCAACAC 
CGACAGCAACCAATCAGATATATACAilATGAAATTAGAAGAAACACAGCCTCATGGGACAGA 
AATTTGAGGGAGGGGAACAAAGAATACTTTGGGGGGAAAAGAGTTTTAAiikAAAGAAATTGAA 
AATTGCCTTGCAGATATTTAGGTACAi\TGGAGTTTTCTTTTCCCAAACGGGAAGAACACAGC 
ACACCCGGCTTGGACCCACTGCAAGCTGCATCGTGCAACCTCTTTGGTGCCAGTGTGGGCAA 
GGGCTCAGCCTCTCTGCCCACAGAGTGCCCCCACGTGGAACATTCTGGAGCTGGCCATCCCA 
AATTCAATCAGTCCATAGAGACGAACJ^GAATGAGACCTTCCGGCCCAAGCGTGGCGCTGCGG 
GCACTTTGGTAGACTGTGCCACCACGGCGTGTGTTGTGAAACGTGAAATAAAAAGAGCAAAA 
AAAAA 
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MKTIQPKiymSISWAIFTGLAALCLFQGVPVRSGDATFPKAMDNVTVRQGESATLRCTIDNR 
VTRVAWLNRSTILYAGNDKWCLDPRWLLSNTQTQYSIEIQNVDVYDEGPYTCSVQTDNHPK 
TSRVHLIVQVSPKIVEISSDISINEGNNISLTCIATGRPEPTVTWRHISPKAVGFVSEDEYL 
EIQGITREQSGDYECSASNDVAAPVVRRVKVTVNYPPYISEAKGTGVPVGQKGTLQCEASAV 
PSAEFQWYKDDKRLIEGKKGVKVEm^PFLSKLIFFNVSEHDYGNYTCVASNKLGHTNASIML 
FGPGAVSEVSNGTSRRAGCVWLLPLLVLHLLLKF 



Signal peptide: 

amino acids 1-28 
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GAAAAAAAATCATGAAAACCATCCAGCCAAAAATGCACAATTCTATCTCTTGGGCAATCTTC 
ACGGGGCTGGCTGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAGATGCCACCTT 
CCCCAAAGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTA 
TTGACAACCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAAT 
GACAAGTGGTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAACGCAGTACAGCAT 
CGAGATCCAGAACGTGGATGTGTATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACA 
ACCACCCAAAGACCTCTAGGGTCCACCTCATTGTGCAAGTATCTCCCAAAATTGTAGAGATT 
TCTTCAGATATCTCCATTAATGAAGGJGAACAATATTAGCCTCACCTGCATAGCAACTGGTAG 
ACCAGAG 



mm 
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ATGGCTGGTGACGGCGGGGCCGGGCAGGGQACCGGGGCCGCGGCCCGGGAGCGGGCCaGCTGCCGGGAGCCCTGA 

ATCACCGCCTGGCCCGACTCCACCATGZy^CGTCGCGCTGCaGGAGCTGGGAGCTGGCAGC^ 

AaGGGGACAAGACAGCTGTTAGGCTCACGCa.CGCAGCTGGAGCTGGTCTTAGCAGGTGCCTCTCTACTGCTGGCT 

GCACTGCTTCTGGGCTGCCTTGTGGCCCTAGGGGTCCAGTACaiCAGAGACCCATCCCACAGCaCCTGCCTTAC^ 

GAGGCCTGCATTCGAGTGGCTGGAAAAATCCTGGAGTCCCTGGACCGAGGGGTGAGCCCCTGTGAGGACTTTTAC 

CAGTTCTCCTGTGGGGGCTGGATTCGGAGGAACCCCCTGCCCGATGGGCGTTCTCGCTGGAACACCTTCAACAGC 

CTCTGGGACCAAAACCAGGCCATACTGAAGCACCTGCTTGAAAACACCACCTTCaACTCCAGCAGTGAAGC^^ 

CAGAAGACACAGCGCTTCTACCTATCTTGCCTACAGGTGGAGCGCATTQAGGAGCTGGGAGCCCAGCCACTGAGA 

GACCTCATTGAGAAGATTGGTGGTTGGAACATTACGGGGCCCTGGGACCAGGACAACTTTATGGAGGTGTTQAAG 

GCyVGTAGCAGGQACCTACAGGGCCACCCCATrCTTCACCGTCTACmTCAGTGCCGACTCTAAGAGTTCCAACAGC 

AATGTTATCCAGGTGGACCAGTCTGGGCTCTTTCTGCCCTCTCGGGATTACTACTTAAACAGAACTGCCAATGAG 

AAAGTGCTCACTGCCTATCTGGATTACATGGAG6AACTGGGGATGCTGCTGGGTGGGCGGCCCACCTCCACGAGG 

GAGCAGATGCAGCAGGTGCTGGAGTTGGAGATACAGCTGGCCAACATCACAGTGCCCCAGGACCAGCGGCGCGAC 

GAGGAGAAGATCTACCACAAGATQAG<a.TTTCGGAGCTGCaGGCTCTGGCGCCCTCC».TGGACTGGCTTGAGTTC 

CTGTCTTTCTTGCTGTCACCATTGGAGTTGAGTGACTCTGAGCCTGTGGTGGTGTATGGGATGGATTATTTGCAG 

CAGGTGTCAGAGCTCATCAACCGCACGGAACiCAAGaVTCCTGAACAATTACCTGATCTGGAACCTGGTGCA^ 

ACAACCTCAAGCCTGGACCGACGCTTTGAGTCTGCACAAGAGaAGCTGCTGGAGACCCTCTATGGCACT^ 

TCCTGTGTGCCGAGGTGGCAGACCTGCATCTCCAACACGGATGACGCCCTTGGCTTTGCTTTGGGGTCACTCTTC 

GTGftAGGCCyiCGTTTGACCGGCAAAGCAAAG?^TTGCAGAGGGGATGATCAGCGAAATCCGGACCGC^^ 

GAGGCCCTGGQACAGCTGGTTTGGATGGATGAGAAGACCCGCCAGGCAGCCAAGGAGAAAGCAGATGCCATCTAT 

GATATGATTGGTTTCCa^GACTTTATCCTGQi^GCCCAAAGAGCTGGATGATGTTTATGACGGGTACGAAATTTCT 

GAAGATTCTTTCTTCCAAAACATGTT6AATTTGTACAACTTCTCTGCCAAGGTTATGGCTGACCAGCTCCGCAAG 

CCTCCCAGCCGAQACCAGTGGAGCATGACCCCCCAGACAGTGAATGCCTACTACCTTCCAACTAAGAATGAGATC 

GTCTTCCCCGCTGGCATCCTGCAGGCCCCCTTCTATGCCCGCAACCACCCCAAGGCCCTGAACTTCGGTGGCATC 

CGGCCCTGGTGGCAGAATGAGTCCCTGGCAGCCTTCCGGAACCACACGGCCTGCyVTGGAGGAACAGTACAATCAA 

TACCa^GGTO^TGGGGAGAGGCTC^CGGCCCKraiGACGCTGGGGGAGAACATTACTGACAACGGGGGG 

GCTGCCTACaATGCTTACAAAGCATGGCTGAC3AAAGCATGGGGAGGAGCaGCAACT^^ 

^^S^^P^^^'^^^^^T'^C^'T^^^^TTTGCCaiGGTGTGGTGCTCGGTCCGCACACCAGAGAGCT^ 

CTGGTGACCGACCCCCACAGCCCTGCCCGCTTCCGCGTGCTGGGCACTCTCTCCAACTCCCGTGACTTCCTGCGG 

CACTTCGGCTGCCCTGTCGGCTCCCCCATGA^ICCCAGGGCAGCTGTGTGAGGTGTGGTAGACCTGGATCAGGGG^ 

ATGCAAGCTGGGCTGGGTCTAGTCCCTCCCCCCCACAGGTGACATGAGTACAGACCCTCCTCAATCACCAC^^ 

J^S^^^f^T'^^^^TGCCCCTGCCTCCAGCyVGAGCCCCCACCATTCACTGTGAC^^^ 

GCCTGGAAGAGGTCTGGGTGGGGAGGCCAGTTCCCATAGGAAGGAGTCTGCC 
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MNVALQELGAGSNVGFQKGTRQLLGSRTQLELVLAGASLLLAALLLGCLVALGVQYHRDPSH 
STCLTEACIRVAGKI LES LDRGVS PCEDFYQFSCGGWIRRNPLPDGRSRWNTFNSLWDQNQA 
ILKHLLENTTFNSSSEAEQKTQRFYLSCLQVERIEELGAQPLRDLIEKIGGWNITGPWDQDN 
FMEVLKAVAGTYRATPFFTVYISAD£5KSSNSNVIQVDQSGLFLPSRDYYLNRTANEKYLTAY 
LDYMEELGMLLGGRPTSTREQMQQVIjELEIQLANITVPQDQRRDEEKIYHKMSISELQALAP 

slvdwleflsfllsplelsdsepvvvli'gmdylqqvselinrtepsilnnyliwnlvqkttssl 
drrfesaqeklletlygtkkscvprpjqtcisntddalgfalgslfvkatfdrqskeiaegmi 
seirtafeealgqlvwmdektrqaakekadaiydmigfpdfilepkelddvydgyeisedsf 
fqimlnlynfsakvmadqlrkppsre)qwsmtpqtvnayylptkneivfpagilqapfyar]m 
pkalnfggigwmghelthafddqgreydkegnlrpwwqneslaafrnhtacmeeqymqyqv 
ngerlngrqtlgenitdngglkaaynaykawlrkhgeeqqlpavgltnhqlffvgfaqvwcs 
vrtpessheglvtdphsparfrvlgtlsnsrdflrhfgcpvgspmnpgqlcevw 



Type II Transmembrane domain: 

amino acids 32-57 
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GCCCGGCCCTCCGCCCTCCGCACTCCCGCCTCCCTCCCTCCGCCCGCTCCCGCGCCCTCCTCCCTCCCTCCTCCC 

CAGCTGTCCCGTTCGCGTCATGCCGAGCCTCCCGGCCCCGCCGGCCCCGCTGCTGCTCCTCGGGCTGCTGCTGCT 

CGGCTCCCGGCCGGCCCGCGGCGCCGGCCC^iGAGCCCCCCGTGCTGCCCATCCGTTCTGAGAAGGAGCCGCTGCC 

CGTTCGGGGAGCGGCAGGTAGGTGGGCGCCCGGGGGAGGCGCGGGCGGGGAGTCGGGCTCGGGGCGAGTCAGCGC 

CAGCCCGQAGGGGGCGCGGGGCGCaGGTGGCTCGGCGCGGCGGGCGGCCCGGAGGGTGGGCGGGGGCAGAAGGGC 

GCGGTGCCTGGGACCCGGGACCCGCGGGCAGCCCCCGGGGCGGCACACGGCGCGAGCTGGGCAGCGGCCTCCAGC 

CAAGCCCGTCCCCGCAGGCTGOICCTTCGGCGGGAAGGTCTATGCCTTGGACGAGACGTGGCACCCGG^ 

GGAGCCATTCGGGGTGATGCGCTGCGTGCTGfTGCGCCTGCGAGGCGCAGTGGGGTCGCCGTACCAGGGGCCCTGG 

CSVGGGTC^lGCTGCAAGAACATCAAACCAGAGTGCCCAACCCCGGCCTGTGGGCAGCCGCGCCAGCTC^ 

ctgctgccagacctgcccccaggacttcgtggcgctgctgacagggccgaggtcgcaggcggtggcacgagcccg 
agtctcgctgctgcgctctagcctccgcttctctatctcctacaggcggctggaccgccctaccaggatccgctt 

CTCAGACTCCAATGGCAGTGTCCTGTTTGA6CACCCTGCAGCCCCCACCCAAGATG6CCTGGTCTGTGGGGTGTG 

GCGGGCAGTGCOTCGGTTGTCTCTGCGGCTCCTTAGGGCAGAACAGCTGCyVTGTGGCACTTGTGACACTCACTC^ 

CCCTTCaGGGGAGGTCTGGGGGCCTCTCATCCGGCaCCGGGCCCTGTCCCCAGAGACOTTCaGTGCCaTC^^ 

TCTAGAAGGCCCCCACCAGCaGGGCGTAGGGGGCaVTCa^CCCTGCTCACTCTCaGTGACACAGAGGAOT 

TTTTTTGCTGCTCTTCCGAGGCCTTGCAGGACTAACCCAGGTTCCCTTGAGGCTCCAGATTCTACACCAGGGGCA 

GCTACTGCGAGAACTTCAGGCO^TGTCTCAGCCCAGGAACCAGGCTTTGCTGAGGTGCTGCCC^ 

CCAGGAGATGGACTGGCTGGTGCTGGGGGAGCTGCAGATGGCCCTGGAGTGGGCAGGCAGGCCAGGGCTGCGCAT 

CAGTGGACACATTGCTGCCAGGaAGAGCTGOGACGTCCTGCAAAGTGTCCTTTGTGGGGCTAATGCCCTGATCCC 

AGTCCAAACGGGTGCTGCCGGCTCy^GCCaGCCTCACTCrGCTAGGAAATGGCNCCCTGATCCTCCAGGTGC^ 

GGTAGGGACAACCAGTGAGGTGGTGGCCATQi^CACTGGAAACCAAGCCTCAGCGQAGGGATCAGCCa^^ 

GT6CCACATGGCTGGCCTATCCTCCCCTGCCCCCAGGCCGTGGGTATCTGCCCTGGGCTGGGGTGCCCGAGGGGC 

TCATATGCTGCTGCAGAATGAGCTCTTCCTGi\ACGTGGGCACCAAGGACTTCCCAGACGGAGAGCTTCGGGGGCA 

ACGTGGCTGCCCTGCCCTACTGTGGGGCATAGCGCCCGCCCTGCCCGTGCCCCTAGCAGGAGCCCTGGTGCTACC 

CCCTGTGAAGAGCCaLAGCAGCAGGGCaiCGCCTGGCTTTCCTTGGATACCCACTGTCACCTGCACTATGA^ 

GCTGGCTGGGCTTGGTGGCTCAGAACAAGGa^CTGTCACTGCCCACCTCCTTGGGCCTCCTGGAACGCCAGGGCC 

TCGGCGGCTGCTGAAGGGATTCTATGGCTCACSAGGCCCAGGGTGTGGTGAAGGACCTGGAGCCGGAACTGCTGCG 

GCACCTGGCAAAAGGCATGGCTTCCCTGATGATC^CCACCAAGGTAGCCCCaGAGGGGAGCTCCGAGGGCAGCCT 

CTCCTCCCAGGTGCAC3^TAGCCaACCAATGTC3AGGTTGGCGGACTGCGCCTGGAGGCGGCCGGGGCCGAGGGGGT 

GCGGGCGCTGGGGGCTCCGGATACAGCCTCTGCTGCGCCGCCTGTGGTGCCTGGTCTCCCGGCCCTAGCGCCCGC 

CAAACCTGGTGGTCCTGGGCGGCCCCGAGACCCCAACAC:ATGCTTCTTCGAGGGG<:a^GCAGCGCCCCCACG<^ 

TCGCTGGGCGCCCAACTACGACCCGCTCTGCTCACTCTGCACCTGCCAGAGACGAACGGTQATCTGTGACCCGGT 

GGTGTGCCCACCGCCCAGCTGCCCACaVCCCGCiTGCSLGGCTCCCQACCAGTGCTGCCCTGTTTGCCCTGGCTGCTA 

TTTTGATGGTGACCGGAGCTGGCGGGCAGCGGkSTACGCGGTGGCACCCCGTTGTGCCCCCCTTTGGCTTAATTAA 

GTGTGCTGTCTGCACCTGCAA6CAGGGGGGai.CTG6AGAGGTGCACTGTGAGAAGGTGCAGTGTCCCCGGCTGGC 

CTGTGCCCaGCCTGTGCGTGTCAACCCCACCC3ACTGCTGCAAAasLGTGTCCa.GGTGAGGCCavCCCCC^ 

GQACCCCAT6CAGGCTQATGGGCCCCGGGGC1'GCCGTTTTGCTGGGCa.6TGGTTCCCy^GAQAGTCaGAGCT 

CCCCTCAGTGCCCCCGTTTGGAGAGATGAGCTGTATCACCTGCAQATGTGGGGTAAGTGGGGAGCAGAGGCTTGT 

GTGAGGTGGGTACTGGQAGCCTGGTCTGGAGT'AGGGAGACCTTCCCAGGGAGGTCCCTGAAGAAGCTGAAGGTCA 

CTGTGTCCCAGTGCCTCTGGGGGACACTCAGTGTCTGCTCTGTCTTGTACCAGGCAGGGGTGCCTCACTGTGAGC 

GGGATGACTGTTCACTGCCACTGTCCTGTGGCTCGGGGAAGGAGAGTCGATGCTGTTCCCGCTGCACXSGCCCACC 

GGCGGCGTAAGTGAGGGAGTCCaGGGTC:aGC».GCTGTGAGTGGAGGGCTCACCTGCCTGTGGGACTCCTGATCAG 

GGAAGGGAGCS^CTO^OTGTGTGCAGGAACAGTGCAGCCTGCCTCACAAGTGCCATTCCAATCC^ 

ACCTGGTGGAATTGTTATTTATGACCTTTTCTTTACAAATGAGATTTCTGAAGCTCAGAGAAATTAAGCAAC^ 

ATGAAGGTCACCCaGCTGTGTGCACTGACCTGTTTAGAAAATACTGGCCTTTCTGGGACCAAGGCAGGGATGCTT 

TGCCCTGCCCTCTATGCCTCTCTGTGCCTCTCCACTCCCTCTCCCCTCCTCCAACATTCCCTCCCTTCTGTCTCC 

AGCAGCCCCAGAGACCAGAACTGATCCAGAGCTGGAQAAAGAAGCCGAAGGCTCTTAGGGAGCAGCCa^^ 

AAGTGACCS^GAGGATGGGGCCTGAGCTGGGGfiAGGGGTGGCATCGAGGACCTTCTTGCATTCTCCTGTGGGAAG 

CCCAGTGCCTTTGCTCCTCTGTCCTGCCTCTACTCCCACCCCCACTACCTCTGGGAACCACaGCrCCAC^^ 

GAGAGGCAGCTGGGCCAGACCGAGGTCACaGCCACTCCAAGTCCTGCCCTGCCACCCTCGGCCTCT^ 

GCCCCACCCCTTTCTTCCTGTAO^TAATGTCaCTGGCTTGTTGGGATTTTTAATTTATCTTCACTCSlGCACCAAG 

GGCCCCGQACACTCCACTCCTGCTGCCCCTGAGCT6AGCAGAGTCATTATTGGAGAGTTTTGTATTTATTAAAAC 

ATTTCTTTTTCAGTCTTTGGGCATGAGGTTGGCTCTTTGTGGCCAGGAACCTGAGTGGGGCCTGGTGGAGAAGGG 

GCa!JGAGAGTAGGAGGTGAGAGAGAGGAGCTCT(3ACACTTGGGGAGCTGaAAGAGACCTGGAGAGGCAGAGGATAG 

CGTGGCmTTGGCTGGCATNCCTGGGTTCCGai^GAGGGGCTGGGGATGGTTOTTGAGATGGTCTAGAGACTCAAG 

AATTTAGGGAAGTAGAAGCAGGATTTTGACTG5AGTTTAGTTTCCCACATCGCTGGCCTGTTTGCTGACTTCAT6 

TTTGAAGTTGOTCCAQAGAGAGAATCAAAGGTGTCACCAGCCCCTCTCTCCCTCCTTCCCTTCCCTTCCCr^ 

TTCCCTCCCCTCCCCTCCCCTCCCCTCCCCTCC 
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GGCCGAGCGGGGGTGCTGCGCGGCGGCCGTGATGGCTGGTGACGGCGGGGCCGGGCAGGGGA 
CCGGGGCCGCGGCCCGGGAGCGGGCCAGCTGCCGGGAGCCCTGAATCACCGCCTGGCCCGAC 
TCCACCATGAACGTCGCGCTGCAGGAGCTGGGAGCTGGCAGCAACGTGGGATTCCAGAAGGG 
GACAAGACAGCTGTTAGGCTCACGCACGCAGCTGGAGCTGGTCTTAGCAGGTGCCTCTCTAC 
TGCTGGCTGCACTGCTTCTGGGCTGCCTTGTGGCCCTAGGGGTCCAGTACCACAGAGACCCA 
TCCCACAGCACCTGCCTTACAGAGGCCTGCATTCGAGTGGCTGGAAAAATCCTGGAGTCCCT 
GGACCGAGGGGTGAGCCCCTGTGAGGACTTTTACCAGTTCTCCTGTGGGGGCTGGATTCGGA 
GGAACCCCCTGCCCGATGGGCGTTCTCGCTGGAACACCTTCAACAGCCTCTGGGACCAAAAC 
CAGGCCATACTGAAGCACCTGCTTG.AAAACACCACCTTCAACTCCAGCAGTGAAGCTGAGCA 
GAAGACACAGCGCTTCTACCTATCTTGCCTACAGGTGGAGCGCATTGAGGAGCTGGGAGCCC 
AGCCACTGAGAGACCTCATTGAGAAGATTGGTGGTTGGAACATTACGGGGCCCTGGGACCAG 
GACAACTTTATGGAGGTGTTGAAGGCAGTAGCAGGGACCTACAGGGCCACCCCATTCTTCAC 
CGTCTACATCAGTGCCGACTCTAAGAGTTCCAACAGCAATGTTATCCAGGTGGACCAGTCTG 
GGCTCTTTCTGCCCTCTCGGGATTACTACTTAAACAGAACTGCCAATGAGAAAGTAAGGAAC 
ATCTTCCGAACCCCCATCCCTACCCCTGGCTGAGCTGGGCTGATCCCTGTTGACTTTTCCCT 
TTGCCAAGGGTCAGAGCAGGGAAGGTGAGCCTATCCTGTCACCTAGTGAACAAACTGCCCCT 
CCTTTCTTTCTTCTTTTCTTCCTCCCTCCCTCCCTTTCTTCCCCTTTTCCTTCCTTCCTTCC 
TCTTATTCTTCTAGTAGGTTTCATAGACACCTACTGTGTGCCAGGTCCAGTGGGGGAATTCG 
GAGATATAAGTTTCCGAGCCATTGCCACAGGAAGCGTTCAGTGTCGATGGGTTCATGGACCT 
AGATAGGCTGATAACAAAGCTCACAS.GAGGGTCCTGAGGATTCAGGAGAGACTTATGGAGCC 
AGCAAAGTCTTCCTGAAGAGATTGCJV.TTTGAGCCAGGTCCTGTAG 
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ATGCCTACTACCTTCCAACTAAGAATGAGATCGTCTTCCCCGCTGGCATCCTGCAGGCCCCC 
TTCTATGCCCGCAACCACCCCAAGGCCCTGAACTTCGGTGGCATCGGTGTGGTCATGGGCCA 
TGAGTTGACGCATGCCTTTGATGACCAAGGGCGCGAGTATGACAAAGAAGGGAACCTGCGGC 
CCTGGTGGCAGAATGAGTCCCTGGCAGCCTTCCGGAACCACACGGCCTGCATGGAGGAACAG 
TACAATCAATACCAGGTCAATGGGGAGAGGCTCAACGGCCGCCAGACGCTGGGGGAGAACAT 
TGCTGACAACGGGGGGCTGAAGGCTGCCTACAATGCTTACAAAGCATGGCTGAGAAAGCATG 
GGGAGGAGCAGCAACTGCCAGCCGTGGGGCTCACCAACCACCAGCTCTTCTTCGTGGGATTT 
GCCCAGGTGTGGTGCTCGGTCCGCACACCAGAGAGCTCTCACGAGGGGCTGGTGACCGACCC 
CCACAGCCCTGCCCGCTTCCGCGTGCTGGGCACTCTCTCCAACTCCCGTGACTTCCTGCGGC 
ACTTCGGCTGCCCTGTCGGCTCCCCCATGAACCCAGGGCAGCTGTGTGAGGTGTGGTAGACC 
TGGATCAGGGGAGAAATGGCCAGCTGTCACCAGACCTGGGGCAGCTCTCCTGACAAAGCTGT 
TTGCTCTTGGGTTGGGAGGAAGCAAATGCAAGCTGGGCTGGGTCTAGTCCCTCCCCCCCACA 
GGTGACATGAGTACAGACCCTCCTG^ATCACCACATTGTGCCTCTGCTTTGGGGGTGCCCCT 
GCCTCCAGCAGAGCCCCCACCATTCACTGTGACATCTTTCCGTGTCACCCTGCCTGGAAGAG 
GTCTGGGTGGGGAGGCCAGTTCCCATAGGAAGGAGTCTGCCTCTTCTGTCCCCAGGCTCACT 
CAGCCTGGCGGCCATGGGGCCTGCCGTGCCTGCCCCACTGTGACCCACAGGCCTGGGTGGTG 
TACCTCCTGGACTTCTCCCCAGGCTCACTCAGTGCGCACTTAGGGGTGGACTCAGCTCTGTC 
TGGCTCACCCTCACGGGCTACCCCCACCTCACCCTGTGCTCCTTGTGCCACTGCTCCCAGTG 
CTGCTGCTGACCTTCACTGACAGCTCCTAGTGGAAGCCCAAGGGCCTCTGAAAGCCTCCTGC 
TGCCCACTGTTTCCCTGGGCTGAGAC3GGGAAGTGCATATGTGTAGCGGGTACTGGTTCCTGT 
GTCTTAGGGCACAAGCCTTAGCAAATGATTGATTCTCCCTGGACAAAGCAGGAAAGCAGATA 
GAGCAGGGAAAAGGAAGAACAGAGTTTATTTTTACAGAAAAGAGGGTGGGAGGGTGTGGTCT 
TGGCCCTTATAGGACC 
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CCCACGCGTCCGAGCCGCCCGAGAATTAGACACACTCCGGACGCGGCCAAAAGCAACCGAGA 
GGAGGGGAGGCAAAAACACCGAAA2^ACAAAAAGAGAGAAACAACACCCAAC71ACTGGGGTGG 

ggggaagaaagaaagaaaagaaacccacccacccaccaaaaaaaaaaaaaaaaaaaaaaaaa 
aaaaaaaaaaaatcctgtggcgcgccgcctggttcccgggaagactcgccagcaccaggggg 
tqggggagtgcgagctgaaagctgctggagagtgagcagccctagcagggatggac atga tg 
ctgttggtgcagggtgcttgttgctcgaaccagtggctggcggcggtgctcctcagcctgtg 
ctgcctgctaccctcctgcctcccggctggacagagtgtggacttcccctgggcggccgtgg 
acaacatgatggtcagaaaaggggacacggcggtgcttaggtgttatttggaagatggagct 
tcaaagggtgcctggctgaaccggtcaagtattatttttgcgggaggtgataagtggtcagt 
ggatcctcgagtttcaatttcaacattgaataaaagggactacagcctccagatacagaatg 
tagatgtgacagatgatggcccata(::a.cgtgttctgttcagactcaacatacacccagaaca 
atgcaggtgcatctaactgtgcaagttcctcctaagatatatgacatctcaaatgatatgac 
cgtcaatgaaggaaccaacgtcactcttacttgtttggccactgggaaaccagagccttcca 
tttcttggcgacacatctccccatcagcaaaaccatttgaaaatggacaatatttggacatt 
tatggaattacaagggaccaggctgcjggaatatgaatgcagtgcggaaaatgctgtgtcatt 
cccagatgtgaggaaagtaaaagttgttgtcaactttgctcctactattcaggaaattaaat 
ctggcaccgtgacccccggacgcagtggcctgataagatgtgaaggtgcaggtgtgccgcct 
ccagcctttgaatggtacaaaggagagaagaagctcttcaatggccaacaaggaattattat 
tcaaaattttagcacaagatccattctcactgttaccaacgtgacacaggagcacttcggca 
attatacctgtgtggctgccaa.caagctaggcacaaccaatgcgagcctgcctcttaaccct 
ccaagtacagcccagtatggaattaccgggagcgctgatgttcttttctcctgctggtacct 
tgtgttgacactgtcctctttcaccjlgcatattctacctgaagaatgccattctaca ataaa 
ttcaaagacccataaaaggcttttaaggattctctgaaagtgctgatggctggatccaatct 
ggtacagtttgttaaaagcagcgtgggatataatcagcagtgcttacatggggatgatcgcc 
ttctgtagaattgctcattatgtaa7i.tactttaattctactcttttttgattagctacatta 
ccttgtgaagcagtacacattgtcctttttttaagacgtgaaagctctgaaattacttttag 
aggatattaattgtgatttcatgtttgtaatctacaacttttcaaaagcattcagtcatggt 
ctgctaggttgcaggctgtagtttacaaaaacgaatattgcagtgaatatgtgattctttaa 
ggctgcaatacaagcattcagttccctgtttcaataagagtcaatccacatttacaaagatg 

CATTTTTTTCTTTTTTGATAAAAAAGCAAATAATATTGCCTTCAGATTATTTCTTCAAAATA 
TAACACATATCTAGATTTTTCTGCTTGCATGATATTCAGGTTTCAGGAATGAGCCTTGTAAT 
ATAACTGGCTGTGCAGCTCTGCTTCTCTTTCCTGTAAGTTCAGCATGGGTGTGCCTTCATAC 
AATAATATTTTTCTCTTTGTCTCCAA.CTAATATAAAATGTTTTGCTAAATCTTACAATTTGA 
AAGTAAAAATAAACCAGAGTGATCAAGTTAAACCATACACTATCTCTAAGTAACGAAGGAGC 
TATTGGACTGTAAAAATCTCTTCCTGCACTGACAATGGGGTTTGAGAATTTTGCCCCACACT 
AACTCAGTTCTTGTGATGAGAGACAATTTAATAACAGTATAGTAAATATACCATATGATTTC 
TTTAGTTGTAGCTAAATGTTAGATCCA.CCGTGGGAAATCATTCCCTTTAAAATGACAGCACA 
GTCCACTCAAAGGATTGCCTAGCAATACAGCATCTTTTCCTTTCACTAGTCCAAGCCAAAAA 
TTTTAAGATGATTTGTCAGAAAGGGCACAAAGTCCTATCACCTAATATTACAAGAGTTGGTA 
AGCGCTCATCATTAATTTTATTTTGTGGCAGGTATTATGACAGTCGACCTGGAGGGTATGGA 
TATGGATATGGACGTTCCAGAGACTATAATGGCAGAAACCAGGGTGGTTATGACCGCTACTC 
AGGAGGAAATTACAGAGACAATTATGACAACTGAAATGAGACATGCACATAATATAGATACA 
CAAGGAATAATTTCTGATCCAGGATCGTCCTTCCAAATGGCTGTATTTATAAAGGTTTTTGG 
AGCTGCACTGAAGCATCTTATTTTATAGTATATCAACCTTTTGTTTTTAAATTGACCTGCCA 



AGACAAATTATGGGACGTTTGTCAAAflAAAAAAAAAAAAAAAAAAAAAAA 
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MMLLVQGACCSNQWLAAVLLSLCCLLPSCLPAGQSVDFPWAAVDNMMWKGDTAVLR^ 
GASKGAWLNRSS 1 1 FAGGDKWSVDPRVS I STLNKRDYSLQIQNVDVTDDGPYTCSVQTQHTP 
RTMQVHLTVQVPPKI YDI SNDMTVNEGTNVTLTCLATGKPEPS I S WRHI SPS AKPPENGQYL 
DIYGITRDQAGEYECSAENAVSFPDWKVKVWNFAPTIQEIKSGTVTPGRSGLIRCEGAGV 
PPPAFEWYKGEKKLFNGQQGI I IQNFSTRS ILTVTNVTQEHFGNYTCVAANKLGTTNASLPL 
NPPSTAQYGITGSADVLFSCWYLVLTLSSFTSIFYLKNAILQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-31 

Transmembrane domain: 

amino acids 326-345 

N-glycosylation sites. 

amino acids 71-75, 153-157, 273-277, 284-288, 292-296, 305-309 

Casein kinase II phosphorylation site. 

amino acids 147-151, 208-212, 224-228 

Tyrosine kinase phosphorylation site. 

amino acids 178-186 

N-myristoylation sites. 

amino acids 7-13, 63-70, 67-73, 151-157, 239-245, 291-297, 
302-308, 319-325 

Myelin PO protein; 

amino acids 92-121 



Iff 



Him 
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AGTGGTTCGATGGGAAGGATCTTTCTCCAAGTGGTTCCTCTTGAGGGGAGCATTTCTGCTGG 
CTCCAGGACTTTGGCCATCTATAAAGCTTGGCAATGAGAAATAAGAAAATTCTCAAGGAGGA 
CGAGCTCTTGAGTGAGACCCAACAAGCTGCTTTTCACCAAATTGCAATGGAGCCTTTCGAAA 
TCAATGTTCCAAAGCCCAAGAGGAGiLAATGGGGTGAACTTCTCCCTAGCTGTGGTGGTCATC 
TACCTGATCCTGCTCACCGCTGGCGCTGGGCTGCTGGTGGTCCAAGTTCTGAATCTGCAGGC 
GCGGCTCCGGGTCCTGGAGATGTATTTCCTCAATGACACTCTGGCGGCTGAGGACAGCCCGT 
CCTTCTCCTTGCTGCAGTCAGCACACCCTGGAGAACACCTGGCTCAGGGTGCATCGAGGCTG 
CAAGTCCTGCAGGCCCAACTCACCTGGGTCCGCGTCAGCCATGAGCACTTGCTGCAGCGGGT 
AGACAACTTCACTCAGAACCCAGGGATGTTCAGAATCAAAGGTGAACAAGGCGCCCCAGGTC 
TTCAAGGTCACAAGGGGGCCATGGGCATGCCTGGTGCCCCTGGCCCGCCGGGACCACCTGCT 
GAGAAGGGAGCCAAGGGGGCTATGGGiACGAGATGGAGCAACAGGCCCCTCGGGACCCCAAGG 
CCCACCGGGAGTCAAGGGAGAGGCGGGCCTCCAAGGACCCCAGGGTGCTCCAGGGAAGCAAG 
GAGCCACTGGCACCCCAGGACCCCAA.GGAGAGAAGGGCAGCA2yiGGCGATGGGGGTCTCATT 
GGCCCAAAAGGGGAAACTGGAA.CTA/LGGGAGAGAAAGGAGACCTGGGTCTCCCAGGAAGCAA 
AGGGGACAGGGGCATGAAAGGAGATGCAGGGGTCATGGGGCCTCCTGGAGCCCAGGGGAGTA 
AAGGTGACTTCGGGAGGCCAGGCCCACCAGGTTTGGCTGGTTTTCCTGGAGCTAAAGGAGAT 
CAAGGACAACCTGGACTGCAGGGTGTTCCGGGCCCTCCTGGTGCAGTGGGACACCCAGGTGC 
CAAGGGTGAGCCTGGCAGTGCTGGCTCCCCTGGGCGAGCAGGACTTCCAGGGAGCCCCGGGA 
GTCCAGGAGCCACAGGCCTGAAAGG^iAGCAAAGGGGACACAGGACTTCAAGGACAGCAAGGA 
AGAAAAGGAGAATCAGGAGTTCCAGGCCCTGCAGGTGTGAAGGGAGAACAGGGGAGCCCAGG 
GCTGGCAGGTCCCAAGGGAGCCCCTGGACAAGCTGGCCAGAAGGGAGACCAGGGAGTGAAAG 
GATCTTCTGGGGAGCAAGGAGTAAAGGGAGAAAAAGGTGAAAGAGGTGAAAACTCAGTGTCC 
GTCAGGATTGTCGGCAGTAGTAACCGAGGCCGGGCTGAAGTTTACTACAGTGGTACCTGGGG 
GACAATTTGCGATGACGAGTGGCAAliATTCTGATGCCATTGTCTTCTGCCGCATGCTGGGTT 
ACTCCAAAGGAAGGGCCCTGTACAAAGTGGGAGCTGGCACTGGGCAGATCTGGCTGGATAAT 
GTTCAGTGTCGGGGCACGGAGAGTACCCTGTGGAGCTGCACCAAGAATAGCTGGGGCCATCA 
TGACTGCAGCCACGAGGAGGACGCAGGCGTGGAGTGCAGCGTCTGACCCGGAAACCCTTTCA 
CTTCTCTGCTCCCGAGGTGTCCTCGGGCTCATATGTGGGAAGGCAGAGGATCTCTGAGGAGT 
TCCCTGGGGACAACTGAGCAGCCTCTGGAGAGGGGCCATTAATAAAGCTCAACATCATTGA 
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></usr/seqc3b2/sst/DNA/Dnaseqs.full/ss.DNA58886 
xsubunit 1 of 1, 520 aa, 1 stop 
xMW: 52658, pi: 9.16, NX(S/T): 3 

MRNKKILKEDELLSETQQAAFHQIAMEPFEINVPKPKRRNGVNFSLAVWIYLILLTAGAGL 
LWQVLNLQARLRVLEMYFLmDTLAAEDSPSFSLLQSAHPGEHLAQGASRLQVLQAQLTPJVR 
VSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRD 
GATGPS6PQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGE 
KGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPG 
PPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPA 
GVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGR 
AEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQIWLDNVQCRGTESTLW 
SCTKNSWGHHDCSHEEDAGVECSV 

Transmembrane domain: 

amino acids 47-66 (type II) 

N-glycosylation sites. 

amino acids 43-47, 83-87, 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 432-440 

N-myristoylation sites. 

amino acids 41-47, 178-184, 253-259, 274-280, 340-346, 346-352, 
400-406, 441-447, 475-481, 490-496, 515-521 

Amidation site. 

amino acids 360-364 

Leucine zipper pattern. 

amino acids 56-78 

Speract receptor repeat 

amino acids 422-471, 488-519 

Clq domain proteins. 

amino acids 151-184, 301-334, 316-349 
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CCCACGCGTCCGAAGGCAGACAAAGGTTCATTTGTAAAGAAGCTCCTTCCAGCACCTCCTCT 
CTTCTCCTTTTGCCCAAACTCACCC\GTGAGTGTGAGCATTTAAGAAGCATCCTCTGCCAAG 
ACCAAAAGGAAAGAAGJyWlAGGGCClAAAAGCCAAAATGAAACTGATGGTACTTGTTTTCAC 
CATTGGGCTAACTTTGCTGCTAGGAGTTCAAGCCATGCCTGCAAATCGCCTCTCTTGCTACA 
GAAAGATACTAAAAGATCACAACTGTCACAACCTTCCGGAAGGAGTAGCTGACCTGACACAG 
ATTGATGTCAATGTCCAGGATCATTI'CTGGGATGGGAAGGGATGTGAGATGATCTGTTACTG 
CAACTTCAGCGAATTGCTCTGCTGCCCAAAAGACGTTTTCTTTGGACCAAAGATCTCTTTCG 
TGATTCCTTGCAACAATCAATGAGA?LTCTTCATGTATTCTGGAGAACACCATTCCTGATTTC 
CCACAAACTGCACTACATCAGTATAS.CTGCATTTCTAGTTTCTATATAGTGCAATAGAGCAT 
AGATTCTATAAATTCTTACTTGTCTjyiGACAAGTAAATCTGTGTTAAACAAGTAGTAATAAA 
AGTTAATTCAATCTAAAAAAAAAAAAA 



FIGURE 234 

</usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA52758 
<subunit 1 of 1, 98 aa, 1 stop 
<MW: 11081, pi: 6.68, NX(S/T): 1 

MKLMVLVFTIGLTLLLGVQAMPANRLSCYRKILKDHNCHNLPEGVADLTQIDVNVQDHFWDG 
KGCEMI CYCNFSELLCCPKDVFFGPKI SFVI PCNNQ 

Important features : 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 72-76 

Tyrosine kinase phosphorylation site. 

amino acids 63-71 
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CCCACGCGTCCGCGGACGCGTGGGCTGGACCCCAGGTCTGGAGCGAATTCCAGCCTGCAGGG 
CTGATAAGCGAGGCATTAGTGAGATTGAGAGAGACTTTACCCCGCCGTGGTGGTTGGAGGGC 
GCGCAGTAGAGCAGCAGCACAGGCGCGGGTCCCGGGAGGCCGGCTCTGCTCGCGCCGA GATG 
TGGAATCTCCTTCACGAAACCGACTCGGCTGTGGCCACCGCGCGCCGCCCGCGCTGGCTGTG 
CGCTGGGGCGCTGGTGCTGGCGGGTGGCTTCTTTCTCCTCGGCTTCCTCTTCGGGTGGTTTA 
TAAAATCCTCCAATGAAGCTACTAACATTACTCCAAAGCATAATATGAAAGCATTTTTGGAT 
GAATTGAAAGCTGAGAACATCAAGA5.GTTCTTACATAATTTTACACAGATACCACATTTAGC 
AGGAACAGAACAAAACTTTCAGCTTGCAAAGCAAATTCAATCCCAGTGGAAAGAATTTGGCC 
TGGATTCTGTTGAGCTAGCTCATTATGATGTCCTGTTGTCCTACCCAAATAAGACTCATCCC 
AACTACATCTCAATAATTAATGAAGATGGAAATGAGATTTTCAACACATCATTATTTGAACC 
ACCTCCTCCAGGATATGAAAATGTTTCGGATATTGTACCACCTTTCAGTGCTTTCTCTCCTC 
AAGGAATGCCAGAGGGCGATCTAGTGTATGTTAACTATGCACGAACTGAAGACTTCTTTAAA 
TTGGAACGGGACATGAAAATCAATTGCTCTGGGAAAATTGTAATTGCCAGATATGGGAAAGT 
TTTCAGAGGAAATAAGGTTAAAAATGCCCAGCTGGCAGGGGCCAAAGGAGTCATTCTCTACT 
CCGACCCTGCTGACTACTTTGCTCCTGGGGTGAAGTCCTATCCAGACGGTTGGAATCTTCCT 
GGAGGTGGTGTCCAGCGTGGAAATATCCTAAATCTGAATGGTGCAGGAGACCCTCTCACACC 
AGGTTACCCAGCAAATGAATATGCTTATAGGCGTGGAATTGCAGAGGCTGTTGGTCTTCCAA 
GTATTCCTGTTCATCCAATTGGATACTATGATGCACAGAAGCTCCTAGAAAAAATGGGTGGC 
TCAGCACCACCAGATAGCAGCTGGAGAGGAAGTCTCAAAGTGCCCTACAATGTTGGACCTGG 
CTTTACTGGAAACTTTTCTACACAAAAAGTCAAGATGCACATCCACTCTACCAATGAAGTGA 
CGAGAATTTACAATGTGATAGGTACI'CTCAGAGGAGCAGTGGAACCAGACAGATATGTCATT 
CTGGGAGGTCACCGGGACTCATGGGTGTTTGGTGGTATTGACCCTCAGAGTGGAGCAGCTGT 
TGTTCATGAAATTGTGAGGAGCTTTG3AACACTGAAAAAGGAAGGGTGGAGACCTAGAAGAA 
CAATTTTGTTTGCAAGCTGGGATGCAGAAGAATTTGGTCTTCTTGGTTCTACTGAGTGGGCA 
GAGGAGAATTCAAGACTCCTTCAAGAGCGTGGCGTGGCTTATATTAATGCTGACTCATCTAT 
AGAAGGAAACTACACTCTGAGAGTTGATTGTACACCGCTGATGTACAGCTTGGTACACAACC 
TAACAAAAGAGCTGAAAAGCCCTGATGAAGGCTTTGAAGGCAAATCTCTTTATGAAAGTTGG 
ACTAAAAAAAGTCCTTCCCCAGAGTTCAGTGGCATGCCCAGGATAAGCAAATTGGGATCTGG 
AAATGATTTTGAGGTGTTCTTCCAACGACTTGGAATTGCTTCAGGCAGAGCACGGTATACTA 
AAAATTGGGAAACAAACAAATTCAGCGGCTATCCACTGTATCACAGTGTCTATGAAACATAT 
GAGTTGGTGGAAAAGTTTTATGATCCAATGTTTAAATATCACCTCACTGTGGCCCAGGTTCG 
AGGAGGGATGGTGTTTGAGCTAGCCAATTCCATAGTGCTCCCTTTTGATTGTCGAGATTATG 
CTGTAGTTTTAAGAAAGTATGCTGACAAAATCTACAGTATTTCTATGAAACATCCACAGGAA 
ATGAAGACATACAGTGTATCATTTGATTCACTTTTTTCTGCAGTAAAGAATTTTACAGAAAT 
TGCTTCCAAGTTCAGTGAGAGACTCCAGGACTTTGACAAAAGCAACCCAATAGTATTAAGAA 
TGATGAATGATCAACTCATGTTTCTGGAAAGAGCATTTATTGATCCATTAGGGTTACCAGAC 
AGGCCTTTTTATAGGCATGTCATCTATGCTCCAAGCAGCCACAACAAGTATGCAGGGGAGTC 
ATTCCCAGGAATTTATGATGCTCTGTTTGATATTGAAAGCAAAGTGGACCCTTCCAAGGCCT 
GGGGAGAAGTGAAGAGACAGATTTATGTTGCAGCCTTCACAGTGCAGGCAGCTGCAGAGACT 
TTGAGTGAAGTAGCCTAAGAGGATTTTTTAGAGAATCCGTATTGAATTTGTGTGGTATGTCA 
CTCAGAAAGAATCGTAATGGGTATATTGATAAATTTTAAAATTGGTATATTTGAAATAAAGT 
TGAATATTATATATAA 
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></usr/seqdb2/sst/DNA/Dnaseqs . full/ss .DNA52756 
xsiibunit 1 of 1, 750 aa, 1 stop 
xMW: 84305, pi: 6.93, NX(S/T): 10 

MWNLLHETDSAVATARRPRWLCAGALVLAGGFFLLGFLFGWFIKSSNEATNITPKHNMKAFL 
DELKAENI KKFLHNFTQI PHLAGTEQNFQLAKQIQSQWKEFGLDSVELAHYDVLLSYPNKTH 
PNYISIINEDGNEIFNTSLFEPPPPGYENVSDIVPPFSAFSPQGMPEGDLVYVNYARTEDFF 
KLERDMKINCSGKIVIARYGKYFRGNKVKNAQLAGAKGVILYSDPADYFAPGVKSYPDGWNL 
PGGGVQRGNILNLNGAGDPLTPGYPA]!JEYAYRRGIAEAVGLPSIPVHPIGYYDAQKLLEKMG 
GSAPPDSSWRGSLKVPYNVGPGFTGNFSTQKYKMHIHSTNEVTRIYNVIGTLRGAVEPDRYV 
ILGGHRDSWVFGGIDPQSGAAWHEIVRSFGTLKKEGWRPRRTILFASWDAEEFGLLGSTEW 
AEENSRLLQERGVAYINADSSIEGNYTLRVDCTPLMYSLVHNLTKELKSPDEGFEGKSLYES 
WTKKSPSPEFSGMPRISKLGSGNDFEVFFQRLGIASGRARYTKNWETNKFSGYPLYHSVYET 
YELVEKFYDPMFKYHLTVAQVRGGMVFELANS IVLPFDCRDYAWLRKYADKI YS I SMKHPQ 
EMKTYSVSFDSLFSAVKNFTEIASKFSERLQDFDKSNPIVLRMMNDQLMFLERAFIDPLGLP 
DRPFYRHVIYAPSSHNKYAGESFPGIYDALFDIESKVDPSKAWGEVKRQIYVAAFTVQAAAE 
TLSEVA 

Signal sequence: 

amino acids 1-40 

N-glycosylation sites. 

amino acids 76-80, 121-125, 140-144, 153-157, 195-199, 336-340, 
459-463, 476-480, 638-642 

Tyrosine kinase phosphorylation sites. 

amino acids 363-372, 605-613, 606-613, 617-626 

N-myristoylation sites. 

amino acids 85-91, 168-174, 252-258, 256-262, 282-288, 335-341, 
360-366, 427-433, 529-535, 707-713 



